The release of six corpora (1.3 Million tokens) with full morphological annotations for (Palestinian, Lebanese, Yemeni, Iraqi, Libyan, and Sudanese) dialects. All are annotated using the LDC’s SAMA tagsets.
Search: https://portal.sina.birzeit.edu/curras
Download: https://portal.sina.birzeit.edu/curras/about-en.html
Search: https://portal.sina.birzeit.edu/curras
Download: https://portal.sina.birzeit.edu/curras/about-en.html