Published February 15, 2019 | Version 1.0
Dataset Open

Database of Russian names, surnames and midnames for gender identification

  • 1. Infoculture

Description

Database of names, surnames and midnames across the Russian federation used as source to teach algorithms for gender identification by fullname.

Dataset prepared for MongoDB database. It has MongoDB dump and dump of tables as JSON lines files.

Used in gender identification and fullname parsing software https://github.com/datacoon/russiannames

Available under Creative Commons CC-BY SA by default.

Files

russiannames_db_bson.zip

Files (14.1 MB)

Name Size Download all
md5:f11f7608eaa2a82d3dae98530c9f26b4
8.1 MB Preview Download
md5:10b4bf03e1eea33f72d4284fd2a582b9
6.0 MB Preview Download