We observed a few things during the analysis, which are reported below:

- For some Chinese projects (e.g. alibaba/ice), different versions of equivalent characters appeared. 
  For example, instead of using ":" they used "：". In some cases, these characters
  were automatically replaced by GFM with the question mark "?", in other cases, the string was broken. 
  We manually fixed each corrupted paragraph.
