Table Lemma List - warwickfoster/qurantools GitHub Wiki
The lemma-list
table contains base forms of words (lemmas) found in the Qur'an. It acts as a lexical reference for scholars analyzing word usage, morphology, and semantics in the text.
lemma-list
Table
Analysis of the Below is the detailed analysis of the lemma-list
table, with unique descriptions for each field.
Table Name | Field Name | Description |
---|---|---|
lemma-list |
LEMMA ID |
A unique identifier for each lemma entry, serving as the primary key for the table. |
lemma-list |
ENGLISH |
The English transliteration of the lemma, providing a readable representation for non-Arabic speakers. |
lemma-list |
ENGLISH-BINARY |
A binary-encoded representation of the English transliteration, primarily for computational purposes. |
lemma-list |
ARABIC |
The lemma in Arabic script, capturing its original orthographic representation. |
lemma-list |
ARABIC ALTERNATE RENDERING |
An alternative rendering of the Arabic lemma, reflecting variations or orthographic differences. |
lemma-list |
ROOT |
The linguistic root associated with the lemma, linking it to its morphological base. |
lemma-list |
ROOT-BINARY |
A binary-encoded representation of the root for computational analysis. |
lemma-list |
COUNT |
The total number of occurrences of the lemma in the Qur'an. |
lemma-list |
COUNT MECCAN |
The number of times the lemma appears in Meccan chapters. |
lemma-list |
COUNT MEDINAN |
The number of times the lemma appears in Medinan chapters. |
lemma-list |
ENGLISH TRANSLITERATED |
A phonetic representation of the lemma in Roman script, aiding in pronunciation for non-Arabic speakers. |
lemma-list |
100-COUNT-ALL |
The normalized frequency of the lemma per 100 words across the entire Qur'an. |
lemma-list |
100-COUNT-MECCAN |
The normalized frequency of the lemma per 100 words in Meccan chapters. |
lemma-list |
100-COUNT-MEDINAN |
The normalized frequency of the lemma per 100 words in Medinan chapters. |
lemma-list |
GLOSS |
A concise English explanation or meaning of the lemma, providing its semantic context. |
lemma-list |
PROPER NOUN |
A flag indicating whether the lemma is a proper noun (e.g., a name or place). |
lemma-list |
CORRECTED TRANSLITERATION |
A corrected transliteration of the lemma if adjustments were necessary for accuracy or consistency. |
lemma-list |
ALTERNATIVE TRANSLITERATION |
An alternative transliteration of the lemma, reflecting regional or scholarly differences. |
lemma-list |
LEMMA FIX NOT NEEDED |
A flag indicating whether the lemma required no corrections during data processing. |
lemma-list |
LEMMA FIXED BY USER |
A flag showing whether the lemma's transliteration or other attributes were corrected manually by a user. |
lemma-list |
AJ FOREIGN PAGE |
A reference to additional external sources or pages where the lemma may be further described, such as in lexicons or concordances. |
Key Insights
-
Linguistic Focus:
- The table connects lemmas to their roots (
ROOT
) and provides phonetic, orthographic, and semantic details for in-depth linguistic analysis. - Frequency fields (
COUNT
,100-COUNT-ALL
, etc.) enable statistical studies of lemma distribution in the Qur'an.
- The table connects lemmas to their roots (
-
Applications:
- Supports linguistic studies by linking lemmas to their roots and providing contextual meanings.
- Facilitates translation work with accurate glosses and transliterations.
-
Error Tracking:
- The
CORRECTED TRANSLITERATION
,LEMMA FIX NOT NEEDED
, andLEMMA FIXED BY USER
fields ensure data accuracy and transparency in lemma processing.
- The
First 10 Rows Example
LEMMA ID | ENGLISH | ENGLISH-BINARY | ARABIC | ARABIC ALTERNATE RENDERING | ROOT | ROOT-BINARY | COUNT | COUNT MECCAN | COUNT MEDINAN | ENGLISH TRANSLITERATED | 100-COUNT-ALL | 100-COUNT-MECCAN | 100-COUNT-MEDINAN | GLOSS | PROPER NOUN | CORRECTED TRANSLITERATION | ALTERNATIVE TRANSLITERATION | LEMMA FIX NOT NEEDED | LEMMA FIXED BY USER | AJ FOREIGN PAGE |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | $a>on | $a>on\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَأْن | nan | $An | $An\0\0\0\0\0\0\0\0\0 | 4 | 3 | 1 | shan | 0.00536164 | 0.00638311 | 0.00362253 | nan | 0 | sha'n! | test1,test2 | 1 | 1 | nan |
2 | $aAEir | $aAEir\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَاعِر | nan | $Er | $Er\0\0\0\0\0\0\0\0\0 | 5 | 5 | 0 | sh??ir | 0.00670205 | 0.0106385 | 0 | nan | 0 | zZZ | nan | nan | 1 | nan |
3 | $aA^'a | $aA^'a\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَآءَ | nan | $yA | $yA\0\0\0\0\0\0\0\0\0 | 236 | 160 | 76 | sh?a | 0.316337 | 0.340433 | 0.275312 | nan | 0 | nan | nan | nan | nan | nan |
4 | $aA^q~u | $aA^q~u\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَآقُّ | nan | $qq\0\0\0\0\0\0\0\0\0 | 7 | 1 | 6 | sh?qqu | 0.00938287 | 0.0021277 | 0.0217352 | nan | 0 | nan | nan | nan | nan | nan | |
5 | $aAhid | $aAhid\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَاهِد | nan | $hd | $hd\0\0\0\0\0\0\0\0\0 | 21 | 14 | 7 | sh?hid | 0.0281486 | 0.0297879 | 0.0253577 | nan | 0 | nan | nan | nan | nan | nan |
6 | $aAkilat | $aAkilat\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَاكِلَت | nan | $kl | $kl\0\0\0\0\0\0\0\0\0 | 1 | 1 | 0 | sh?kilat | 0.00134041 | 0.0021277 | 0 | nan | 0 | nan | nan | nan | 1 | nan |
7 | $aAkir | $aAkir\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَاكِر | nan | $kr | $kr\0\0\0\0\0\0\0\0\0 | 14 | 10 | 4 | sh?kir | 0.0187657 | 0.021277 | 0.0144901 | nan | 0 | nan | nan | nan | 1 | nan |
8 | $aAni} | $aAni}\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَانِئ | nan | $nA | $nA\0\0\0\0\0\0\0\0\0 | 1 | 1 | 0 | sh?ni | 0.00134041 | 0.0021277 | 0 | nan | 0 | nan | nan | nan | 1 | nan |
9 | $aAriko | $aAriko\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَارِكْ | nan | $rk | $rk\0\0\0\0\0\0\0\0\0 | 1 | 1 | 0 | sh?rik | 0.00134041 | 0.0021277 | 0 | To associate something with God | 0 | nan | nan | nan | 1 | 185 |
10 | $aAwiro | $aAwiro\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 | شَاوِرْ | nan | $wr | $wr\0\0\0\0\0\0\0\0\0 | 1 | 0 | 1 | sh?wir | 0.00134041 | 0 | 0.00362253 | nan | 0 | nan | nan | nan | nan | nan |