README ########################################### DESCRIPTION These files contain the data and the research script for analyzing the use of ChatGPT in dental research writing from 2018 to 2024. The dataset includes various CSV files with publication details and analysis scripts to investigate trends and patterns in the usage of specific signaling words associated with ChatGPT. ########################################### KEYWORDS ChatGPT, Dental Research, Large Language Models, Text Analysis, Scientific Writing, Natural Language Processing ########################################### CONTACT Sergio E. Uribe, sergio.uribe@rsu.lv ########################################### CREATION DATE (yyyy-mm-dd) 2024-06-06 ########################################### GRANT/FUNDING ASSOCIATED SEU and IM acknowledge financial support from The Latvian Council of Science, project No lzp-2022/1-0047, “IEVA Project.” SUE acknowledges financial support from the European Union’s Horizon 2020 research and innovation program under grant agreement No 857287 for the Baltic Biomaterials Centre of Excellence. ########################################### FILES 01_EDA_chatgpt_writing.qmd Contains the exploratory data analysis (EDA) script in Quarto markdown format. This script can be executed using Quarto to perform initial data exploration and visualization. README ChatGPT for Dental Research Writing This is the txt file that contains detailed instructions and metadata for the research dataset and analysis. dental_or_dentistry.csv Contains data related to dental publications filtered by the terms "dental" or "dentistry" from PubMed. signaling.csv Contains data on signaling words identified in the abstracts of dental publications. These words are associated with the use of ChatGPT in writing. df_2018_2024.csv Contains the dataset of dental publications from 2018 to 2024, including various metadata fields such as publication dates and signaling word frequencies. pubmed_20_24_details.csv Includes detailed information on dental publications indexed in PubMed from 2020 to 2024. eng_vs_nig_ken.csv Contains comparative data of publications between English-speaking countries (UK, USA, Australia) and Nigeria-Kenya, focusing on the frequency of signaling words. ########################################### PREREQUISITES TO REPLICATE THE ANALYSIS To recreate the analysis, you need to: 1. Install the necessary R and Quarto packages. 2. Create a folder called "Figs" where the figures will be saved. ########################################### ########################################### SOFTWARE AND PACKAGES USED > sessionInfo() R version 4.4.0 (2024-04-24) Platform: x86_64-pc-linux-gnu Running under: Pop!_OS 22.04 LTS Matrix products: default BLAS: /usr/lib/x86_64-linux-gnu/blas/libblas.so.3.10.0 LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0 locale: [1] LC_CTYPE=en_GB.UTF-8 LC_NUMERIC=C LC_TIME=en_US.UTF-8 LC_COLLATE=en_GB.UTF-8 LC_MONETARY=en_US.UTF-8 [6] LC_MESSAGES=en_GB.UTF-8 LC_PAPER=en_US.UTF-8 LC_NAME=C LC_ADDRESS=C LC_TELEPHONE=C [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C time zone: Europe/Riga tzcode source: system (glibc) attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] scales_1.3.0 kableExtra_1.4.0 viridis_0.6.5 viridisLite_0.4.2 patchwork_1.2.0 ggrepel_0.9.5 here_1.0.1 [8] janitor_2.2.0 lubridate_1.9.3 forcats_1.0.0 stringr_1.5.1 dplyr_1.1.4 purrr_1.0.2 readr_2.1.5 [15] tidyr_1.3.1 tibble_3.2.1 ggplot2_3.5.1 tidyverse_2.0.0 loaded via a namespace (and not attached): [1] gtable_0.3.5 xfun_0.44 bslib_0.7.0 tzdb_0.4.0 vctrs_0.6.5 tools_4.4.0 generics_0.1.3 [8] parallel_4.4.0 fansi_1.0.6 highr_0.10 pacman_0.5.1 pkgconfig_2.0.3 lifecycle_1.0.4 farver_2.1.2 [15] compiler_4.4.0 textshaping_0.3.7 munsell_0.5.1 snakecase_0.11.1 htmltools_0.5.8.1 sass_0.4.9 pillar_1.9.0 [22] crayon_1.5.2 jquerylib_0.1.4 cachem_1.1.0 tidyselect_1.2.1 digest_0.6.35 stringi_1.8.4 labeling_0.4.3 [29] rprojroot_2.0.4 fastmap_1.2.0 grid_4.4.0 colorspace_2.1-0 cli_3.6.2 magrittr_2.0.3 utf8_1.2.4 [36] withr_3.0.0 bit64_4.0.5 timechange_0.3.0 rmarkdown_2.26 bit_4.0.5 gridExtra_2.3 ragg_1.3.2 [43] hms_1.1.3 evaluate_0.23 knitr_1.46 rlang_1.1.3 Rcpp_1.0.12 glue_1.7.0 xml2_1.3.6 [50] svglite_2.1.3 rstudioapi_0.16.0 vroom_1.6.5 jsonlite_1.8.8 R6_2.5.1 systemfonts_1.1.0