UrbanOccupationsOETR_hdr_Nicaea_6k is the first historical Arabic handwritten digit dataset. It is curated from the first series of Ottoman population registers conducted in the mid-nineteenth century. The dataset was controlled manually and cleaned. It has more than 6000 digits. Five thousand images are divided into the training folder, and the remaining 1000 images are divided into the test folder. Please cite the below paper in your publications if you use the dataset:
Can, Yekta S., and M. Erdem Kabadayı. “Curation of Historical Arabic Handwritten Digit Datasets from Ottoman Population Registers: A Deep Transfer Learning Case Study.” In 2020 IEEE International Conference on Big Data (Big Data), 1853–60, 2020. https://doi.org/10.1109/BigData50022.2020.9378445.