Toward Automatic Typography Analysis: Serif Classification and Font Similarities
Creators
- 1. Computer Vision Lab, Ecole Polytechnique fédérale de Lausanne, Switzerland and Department of Computer Vision, Mohamed bin Zayed University of Artificial Intelligence
- 2. EPFL+ECAL Lab, Ecole Polytechnique fédérale de Lausanne, Switzerland
- 3. Computer Vision Lab, Ecole Polytechnique fédérale de Lausanne, Switzerland
Description
Whether a document is of historical or contemporary significance, typography plays a crucial role in its composition. From the early days of modern printing, typographic techniques have evolved and transformed, resulting in changes to the features of typography. By analyzing these features, we can gain insights into specific time periods, geographical locations, and messages conveyed through typography. Therefore, in this paper, we aim to investigate the feasibility of training a model to classify serif types
without knowledge of the font and character. We also investigate how to train a vectorial-based image model able to group together fonts with similar features. Specifically, we compare the use of state-of-theart image classification methods, such as the EfficientNet-B2 and the Vision Transformer Base model with different patch sizes, and the state-of-the-art fine-grained image classification method, TransFG, on the serif classification task. We also evaluate the use of the DeepSVG model to learn to group fonts with similar features. Our investigation reveals that fine-grained image classification methods are better suited for the serif classification tasks and that leveraging the character labels helps to learn more meaningful font similarities.
This repository contains:
- Paper published in the Journal of data mining and digital humanities:
WasimEtAl_Toward_Automatic_Typography_Analysis__Serif_Classification_and_Font_Similarities.pdf
- Two datasets: The first for serif classification
consisting of 126666 training and 2914 font-independent testing images in raster format.
The second dataset for svg based similarity learning consists of 124010 training and 2914
font-independent testing images. The images have been categorized into sans-serif,
linear-serif, slab-serif, and triangular serif by a designer at the EPFL+ECAL Lab
font_serif_dataset.zip
font_svg_dataset.zip
- model weights:
serif_B_16_long_checkpoint.bin: refers to TransFG ViT B/16 Backend
serif_B_32_long_checkpoint.bin: refers to TransFG ViT B/32 Backend
ViT-B_16.npz ViT-B_32.npz are publicly available at:
https://github.com/TACJu/TransFG
and were used for training TransFG ViT B/16 Backend and TransFG ViT B/32 Backend
- code:
https://github.com/TalalWasim/GEST-Serif
Files
font_serif_dataset.zip
Files
(1.8 GB)
Name | Size | Download all |
---|---|---|
md5:f87971f593457b7da225d313b44354d9
|
192.7 MB | Preview Download |
md5:13ef7c1d68cd9e4e33c68542ba691f3a
|
74.0 MB | Preview Download |
md5:343b6a033f7a7fa912fbf69c02818de3
|
1.1 kB | Preview Download |
md5:f5a53cf74b003c90e409b9a765679538
|
346.9 MB | Download |
md5:82bfc61f48cfa6a23d4a5455e5d29f87
|
353.5 MB | Download |
md5:d9715d3bf39e807337a3c3e117756738
|
412.8 MB | Download |
md5:9f43c05aac4a905b3ca9bdac814d4c32
|
419.4 MB | Download |
md5:2fa9a6f625403d7f673092cc6a9f1ed9
|
5.0 MB | Preview Download |