How I Created 33,000 Arabic Lexemes in Wikidata

1 / 3

How I Created 33,000 Arabic Lexemes in Wikidata

Diff·The winners of the 2025 Wiki Loves Monuments photo contest29 April 2026 by Wikimedia Foundation·about 1 month ago

#9WsxAkZU

#lexemes #wikidata #primary #masthead #arabic #project

Reading 0:00

15s threshold

In a world driven by semantic search, artificial intelligence, and structured data, language is no longer merely a means of communication; it has become a data layer upon which modern systems are built. For Arabic, despite its rich morphological and derivational nature, its representation within Wikidata has remained significantly limited. Before the start of this project, Arabic lexemes did not exceed 2,500 entries, and many of them lacked essential morphological and derivational data. Roots were often unlinked, patterns incomplete, verbs without forms, and lexemes without a clear ontological framework. From this point, I began working on the “Arabic Lexeme Enrichment” project during the period from August to December 2025 – not merely as a partial improvement effort, but as a practical attempt to build a coherent Arabic linguistic layer within the platform that can later be relied upon in Wiktionary and other systems.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

How I Created 33,000 Arabic Lexemes in Wikidata