Publicly available medical text data with authentic quality
Description
This dataset is the public medical text record (progress notes) written in Japanese.
Any researchers can use this dataset without privacy issues.
CC BY-NC 4.0
crowd.zip: 9,756 pseudo progress notes written by crowd workers
crowd_evaluated.zip: 83 pseudo progress notes with authentic quality written by crowd workers
MD.zip: 19 pseudo progress notes written by medical doctors
Reference:
Kagawa, R., Baba, Y., & Tsurushima, H. (2021, December). A practical and universal framework for generating publicly available medical notes of authentic quality via the power of crowds. In 2021 IEEE International Conference on Big Data (Big Data) (pp. 3534-3543). IEEE.
http://hdl.handle.net/2241/0002002333
The supplemental files of the paper are here: https://github.com/rinabouk/HMData2021
Files
crowd.zip
Additional details
References
- Kagawa, R., Baba, Y., & Tsurushima, H. (2021, December). A practical and universal framework for generating publicly available medical notes of authentic quality via the power of crowds. In 2021 IEEE International Conference on Big Data (Big Data) (pp. 3534-3543). IEEE. http://hdl.handle.net/2241/0002002333