{ "access": { "embargo": { "active": false, "reason": null }, "files": "public", "record": "public", "status": "open" }, "created": "2022-08-03T11:33:03.153003+00:00", "custom_fields": { "meeting:meeting": { "acronym": "STI 2022", "dates": "7-9 September 2022", "place": "Granada, Spain", "title": "26th International Conference on Science, Technology and Innovation Indicators", "url": "https://sti2022.org/" } }, "deletion_status": { "is_deleted": false, "status": "P" }, "files": { "count": 1, "enabled": true, "entries": { "238.pdf": { "checksum": "md5:35d4988f5f4904690817d45727642333", "ext": "pdf", "id": "603af41e-e444-4cb9-88ab-ec7f7472ad88", "key": "238.pdf", "metadata": null, "mimetype": "application/pdf", "size": 547040 } }, "order": [], "total_bytes": 547040 }, "id": "6957616", "is_draft": false, "is_published": true, "links": { "access": "https://zenodo.org/api/records/6957616/access", "access_links": "https://zenodo.org/api/records/6957616/access/links", "access_request": "https://zenodo.org/api/records/6957616/access/request", "access_users": "https://zenodo.org/api/records/6957616/access/users", "archive": "https://zenodo.org/api/records/6957616/files-archive", "archive_media": "https://zenodo.org/api/records/6957616/media-files-archive", "communities": "https://zenodo.org/api/records/6957616/communities", "communities-suggestions": "https://zenodo.org/api/records/6957616/communities-suggestions", "doi": "https://doi.org/10.5281/zenodo.6957616", "draft": "https://zenodo.org/api/records/6957616/draft", "files": "https://zenodo.org/api/records/6957616/files", "latest": "https://zenodo.org/api/records/6957616/versions/latest", "latest_html": "https://zenodo.org/records/6957616/latest", "media_files": "https://zenodo.org/api/records/6957616/media-files", "parent": "https://zenodo.org/api/records/6957615", "parent_doi": "https://zenodo.org/doi/10.5281/zenodo.6957615", "parent_html": "https://zenodo.org/records/6957615", "requests": "https://zenodo.org/api/records/6957616/requests", "reserve_doi": "https://zenodo.org/api/records/6957616/draft/pids/doi", "self": "https://zenodo.org/api/records/6957616", "self_doi": "https://zenodo.org/doi/10.5281/zenodo.6957616", "self_html": "https://zenodo.org/records/6957616", "self_iiif_manifest": "https://zenodo.org/api/iiif/record:6957616/manifest", "self_iiif_sequence": "https://zenodo.org/api/iiif/record:6957616/sequence/default", "versions": "https://zenodo.org/api/records/6957616/versions" }, "media_files": { "count": 0, "enabled": false, "entries": {}, "order": [], "total_bytes": 0 }, "metadata": { "creators": [ { "affiliations": [ { "name": "University of Campinas" } ], "person_or_org": { "family_name": "Mazoni", "given_name": "Alysson", "name": "Mazoni, Alysson", "type": "personal" } }, { "affiliations": [ { "name": "Universidad de Granada" } ], "person_or_org": { "family_name": "Arroyo-Machado", "given_name": "Wenceslao", "name": "Arroyo-Machado, Wenceslao", "type": "personal" } }, { "affiliations": [ { "name": "Leiden University" } ], "person_or_org": { "family_name": "Traag", "given_name": "Vincent A.", "name": "Traag, Vincent A.", "type": "personal" } }, { "affiliations": [ { "name": "Leiden University" } ], "person_or_org": { "family_name": "Costas", "given_name": "Rodrigo", "name": "Costas, Rodrigo", "type": "personal" } } ], "description": "
In this study we provide a first account of the methodological workflow for a large-scale clustering of the Twitter communities of attention around scientific publications as captured in the open database Crossref Event Data. To the best of our knowledge this is the largest algorithmic clustering of Twitter users and scientific publications performed to date. The availability of this type of clustering opens new analytical possibilities in the study of the Twitter dissemination of scientific publications. For example, making possible the study of the diversity of the communities in which publications have been tweeted enabling the differentiation of publications tweeted in smaller or larger communities, or the identification of those communities that tweet more superficially or automatically.
\n\nFrom a technical point of view, the use of big data tools (Google BigQuery) was implemented given the large size of data involved in the clustering. Moreover, the use of the relative weight allowed for the determination of well connected communities without much skewness in its sizes. The sheer size and availability of open data opens the way for several kinds of analysis that demand careful use of file formats and computation resources, usually based on big data tools such as data warehouses and running multiprocessor code.
\n\nFuture research will necessarily focus on two additional developments: 1) refining the clustering to include those less connected communities in a meaningful manner, making them also more balanced, and 2) implementing a labelling of the different clusters obtained. For the first, additional clustering (e.g. clustering of clusters) and reclustering of smaller clusters will be very likely the approach to go. For the second, we aim at finding potentially meaningful information by collecting metadata from papers (e.g. journals, titles, topics) and Twitter users (e.g. profile descriptions, geolocations, URLs). That information, combined with language processing techniques will potentially allow the labelling of the clusters in order to better characterise the communities and their dynamics in disseminating scientific publications on Twitter.
", "languages": [ { "id": "eng", "title": { "en": "English" } } ], "publication_date": "2022-09-07", "publisher": "Zenodo", "related_identifiers": [ { "identifier": "10.5281/zenodo.7129562", "relation_type": { "id": "isdescribedby", "title": { "de": "Wird beschrieben von", "en": "Is described by" } }, "resource_type": { "id": "presentation", "title": { "de": "Pr\u00e4sentation", "en": "Presentation" } }, "scheme": "doi" } ], "resource_type": { "id": "publication-conferencepaper", "title": { "de": "Konferenzbeitrag", "en": "Conference paper" } }, "rights": [ { "description": { "en": "The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited." }, "icon": "cc-by-icon", "id": "cc-by-4.0", "props": { "scheme": "spdx", "url": "https://creativecommons.org/licenses/by/4.0/legalcode" }, "title": { "en": "Creative Commons Attribution 4.0 International" } } ], "title": "On the quest of scholarly communities of attention: large-scale clustering of Twitter users around scientific publications" }, "parent": { "access": { "owned_by": { "user": 382159 } }, "communities": { "default": "038368a7-eeb9-41df-9c2e-8191082be9da", "entries": [ { "access": { "member_policy": "open", "members_visibility": "public", "record_policy": "open", "review_policy": "open", "visibility": "public" }, "children": { "allow": false }, "created": "2022-07-07T08:50:29.744583+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "id": "038368a7-eeb9-41df-9c2e-8191082be9da", "links": {}, "metadata": { "curation_policy": "All accepted submissions (oral presentations and poster) to the conference which have been subjected to a peer review process.
\r\n", "page": "Proceedings of the Science, Technology and Innovation Indicators Conference held in Granada (Spain) between the 7th and 9th of September of 2022.
\r\n\r\nThe STI Conference is an annual event organized by the European Network of Indicator Designers (ENID). This is the 26th edition of the STI conference series, initiated in 1988 by the Centre of Science and Technology Studies (CWTS) at the University of Leiden. It also happens to be the 1st in-person edition since the COVID-19 pandemic.
", "title": "STI 2022 \"From Global Indicators to Local Applications\" Conference Proceedings" }, "revision_id": 0, "slug": "sti2022grx", "updated": "2022-07-07T08:50:29.868881+00:00" } ], "ids": [ "038368a7-eeb9-41df-9c2e-8191082be9da" ] }, "id": "6957615", "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.6957615", "provider": "datacite" } } }, "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.6957616", "provider": "datacite" }, "oai": { "identifier": "oai:zenodo.org:6957616", "provider": "oai" } }, "revision_id": 5, "stats": { "all_versions": { "data_volume": 40480960.0, "downloads": 74, "unique_downloads": 72, "unique_views": 107, "views": 121 }, "this_version": { "data_volume": 40480960.0, "downloads": 74, "unique_downloads": 72, "unique_views": 107, "views": 121 } }, "status": "published", "updated": "2022-09-30T14:38:45.898809+00:00", "versions": { "index": 1, "is_latest": true } }