Table Quran Data - warwickfoster/qurantools GitHub Wiki

The quran-data table represents the foundational dataset for the Qur'an, including information about its verses, chapters (suras), and textual structure. It provides the original Arabic text and metadata such as verse numbering, chapter information, and divisions of the text for both analysis and display purposes. Each entry corresponds to a single verse, forming the backbone of Qur'anic data management.

Analysis of the quran-data Table

The quran-data table provides granular information about each word in the Qur'an, broken down into linguistic, grammatical, and semantic categories. Each field serves a specific role in capturing the nuances of the Qur'anic text. Below is the analysis with uniquely described fields.


Table Name Field Name Description
quran-data RECORD NUMBER A unique identifier for each record in the table, representing a specific word within a verse.
quran-data LOCATION The hierarchical location of the word in the format (Sura:Verse:Word:Segment).
quran-data SURA-VERSE The reference to the Sura and Verse in which the word is located.
quran-data SURA The chapter (Sura) number of the Qur'an where the word appears.
quran-data VERSE The verse number within the specified Sura.
quran-data WORD The position of the word within the verse.
quran-data SEGMENT The segment number within the word, useful for compound or multi-part words.
quran-data GLOBAL WORD NUMBER A unique global identifier for each word in the Qur'an, across all Suras and verses.
quran-data CHANGES AFFECTING WORD The number of manuscript changes or corrections affecting the word.
quran-data FORM The grammatical form or type of the word, such as prefix, suffix, or stem.
quran-data TRANSLITERATED A phonetic transliteration of the word into Latin script.
quran-data RENDERED ARABIC The word in its rendered Arabic script form.
quran-data TAG A short tag describing the word's role (e.g., CONJ for conjunction).
quran-data TAG EXPLAINED A detailed explanation of the tag, providing context about the word's grammatical role.
quran-data FEATURES Additional linguistic features, such as case, mood, or part of speech, concatenated as a string.
quran-data PART OF WORD Indicates whether the word is part of a compound or multi-part construction.
quran-data CASES The grammatical case of the word (e.g., nominative, accusative, genitive).
quran-data GENDER The gender of the word (e.g., masculine, feminine, or neuter).
quran-data PERSON The grammatical person of the word (e.g., first, second, or third person).
quran-data NUMBER Indicates whether the word is singular, dual, or plural.
quran-data ROUGH TRANSLATION A rough English translation of the word.
quran-data ROOT The root letters of the word in Arabic.
quran-data LEMMA The base form (lemma) of the word in Arabic.
quran-data PARSED FLAG Indicates whether the word has been parsed and tagged for grammatical analysis.
quran-data QTL-PART-OF-WORD Specifies if the word is part of a Quranic textual lemma.
quran-data QTL-TAG-EXPLAINED Explains the tag in Quranic Text Lemma (QTL) terms.
quran-data QTL-LEMMA The Quranic Text Lemma form of the word.
quran-data QTL-LEMMA-BINARY Binary-encoded representation of the Quranic Text Lemma.
quran-data QTL-LEMMA-TRANSLITERATED Transliterated form of the Quranic Text Lemma.
quran-data QTL-ROOT The Quranic Text Root of the word.
quran-data QTL-ROOT-BINARY Binary-encoded representation of the Quranic Text Root.
quran-data QTL-ROOT-TRANSLITERATED Transliterated form of the Quranic Text Root.
quran-data QTL-SPECIAL-WORD-GROUP Indicates if the word belongs to a specific Quranic word group.
quran-data QTL-GENDER The Quranic Text Lemma gender of the word.
quran-data QTL-CASE The grammatical case for Quranic Text Lemma.
quran-data QTL-NUMBER The grammatical number for Quranic Text Lemma.
quran-data QTL-PERSON The grammatical person for Quranic Text Lemma.
quran-data QTL-VOICE Indicates active or passive voice in Quranic Text Lemma.
quran-data QTL-DERIVED-NOUN-TYPE Specifies if the noun is derived and its type in Quranic Text Lemma.
quran-data QTL-ARABIC-FORM The Arabic form of the Quranic Text Lemma.
quran-data QTL-NOUN-STATE Specifies the state of the noun (e.g., definite or indefinite).
quran-data QTL-MOOD The grammatical mood of the word (e.g., indicative, subjunctive).
quran-data ROOT OR PARTICLE Indicates whether the word is a root or particle.
quran-data GLOSS A detailed gloss or semantic explanation of the word.
quran-data FIXED_ARABIC Corrected Arabic text for the word, if applicable.
quran-data FIXED_TRANSLITERATION Corrected transliteration for the word, if applicable.
quran-data FIXED_GLOSS Corrected gloss for the word, if applicable.
quran-data FIXED_DB_APPROVED Indicates if the fixed entries have been approved for database use.
quran-data PREVIOUS ARABIC The Arabic text before any corrections were applied.
quran-data PREVIOUS TRANSLITERATION The transliteration before any corrections were applied.
quran-data TRANSLITERATION FIX APPLIED Indicates whether a transliteration fix was applied.
quran-data FIXED_NOTES Notes explaining the corrections made.
quran-data FIX_CREATED_BY The user or system responsible for creating the fix.
quran-data FIX_APPROVED_BY The user or system responsible for approving the fix.
quran-data FIX_CHANGES_SUGGESTED Indicates if changes were suggested for review.
quran-data AJC FOREIGN WORD Indicates foreign or borrowed words within the Quranic text.

Key Insights

  1. Comprehensive Linguistic Analysis:

    • The quran-data table serves as a foundational resource for linguistic studies, capturing granular details about each word's grammatical, phonetic, and semantic properties.
    • Features like ROOT, LEMMA, and PARSED FLAG enable precise morphological and syntactic parsing.
  2. Advanced Parsing Features:

    • The inclusion of QTL-* fields offers specialized metadata for Quranic Text Lemmas, emphasizing the contextual use of words in the Qur'anic corpus.
    • Fields like TRANSLITERATION FIX APPLIED and FIXED_NOTES ensure data integrity and traceability of corrections.
  3. Interdisciplinary Utility:

    • This table supports multiple disciplines, including computational linguistics, textual criticism, and comparative theology.

Example Interpretation of Data

Row Example:

FIELD VALUE
RECORD NUMBER 127719
LOCATION (100:10:1:2)
SURA-VERSE 100:10
TRANSLITERATED wa-ḥuṣṣila
RENDERED ARABIC وَحُصِّلَ
TAG V
TAG EXPLAINED Verb
ROOT HSl
ROUGH TRANSLATION And is made apparent

Interpretation:

This record represents a verb (TAG: V) in Sura 100, Verse 10. The verb, transliterated as wa-ḥuṣṣila, is rendered in Arabic as وَحُصِّلَ and stems from the root HSl. The root indicates the semantic concept of something being "gathered" or "made apparent," contextualized in this verse as an action. The word is part of a passive voice construction (PERSON: 3, VOICE: Passive) and carries significant theological implications in the context of divine judgment.


Contextual Significance

  1. Textual Criticism:

    • Fields like CHANGES AFFECTING WORD and PREVIOUS ARABIC document manuscript variations, crucial for analyzing the textual history of the Qur'an.
  2. Linguistic Semantics:

    • The table facilitates semantic analysis of Qur'anic language by linking words to their roots, lemmas, and glosses.
  3. Thematic Analysis:

    • By examining FORMULA-* fields, researchers can identify recurring themes and formulaic structures, shedding light on the Qur'an's rhetorical strategies.
  4. Cross-Linguistic Comparisons:

    • Transliteration and transliteration fixes allow for accurate phonetic representations, enabling comparative studies between Arabic and other languages.
  5. Machine Learning and NLP:

    • The structured data can train models for natural language processing tasks, such as part-of-speech tagging, lemmatization, or translation.

First 10 Rows Example

RECORD NUMBER LOCATION SURA-VERSE SURA VERSE WORD SEGMENT GLOBAL WORD NUMBER CHANGES AFFECTING WORD FORM TRANSLITERATED RENDERED ARABIC TAG TAG EXPLAINED FEATURES PART OF WORD CASES GENDER PERSON NUMBER Rough Translation ROOT LEMMA Parsed Flag QTL-PART-OF-WORD QTL-TAG-EXPLAINED QTL-LEMMA QTL-LEMMA-BINARY QTL-LEMMA-TRANSLITERATED QTL-ROOT QTL-ROOT-BINARY QTL-ROOT-TRANSLITERATED QTL-SPECIAL-WORD-GROUP QTL-GENDER QTL-CASE QTL-NUMBER QTL-PERSON QTL-VOICE QTL-DERIVED-NOUN-TYPE QTL-ARABIC-FORM QTL-NOUN-STATE QTL-MOOD FORMULA-2-ROOT FORMULA-3-ROOT F3-ROOT-QTL-ROOT-FLAG FORMULA-4-ROOT FORMULA-5-ROOT FORMULA-3-ROOT-ALL FORMULA-4-ROOT-ALL FORMULA-5-ROOT-ALL FORMULA-3-LEMMA FORMULA-4-LEMMA FORMULA-5-LEMMA FORMULA-3-ANY FORMULA-4-ANY FORMULA-5-ANY ROOT OR PARTICLE GLOSS FIXED_ARABIC FIXED_TRANSLITERATION FIXED_GLOSS FIXED_DB_APPROVED PREVIOUS ARABIC PREVIOUS TRANSLITERATION TRANSLITERATION FIX APPLIED FIXED_NOTES FIX_CREATED_BY FIX_APPROVED_BY FIX_CHANGES_SUGGESTED AJC FOREIGN WORD
127718 (100:10:1:1) 100:10 100 10 1 1 77109 0 wa wa-?u??ila ????????? CONJ Coordinating Conjunction PREFIX w:CONJ+ Prefix nan nan nan nan test nan nan 8 Prefix Coordinating Conjunction (wa) nan \0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 nan nan \0\0\0\0\0\0\0\0\0\0 nan nan nan nan nan 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 0 0 0 0 0 0 nan And is made apparent nan nan nan Y nan nan N nan 0 1 nan
127719 (100:10:1:2) 100:10 100 10 1 2 77109 0 HuS~ila wa-?u??ila ????????? V Verb STEM POS:V PERF PASS (II) LEM:HuS~ila ROOT:HSl 3MS Stem nan nan nan nan nan nan nan 8 Stem Perfect Verb HuS~ila HuS~ila\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 ?u?~ila HSl HSl\0\0\0\0\0\0\0 ??l nan Masculine nan Singular 3 Passive nan II nan nan 0 0 0 0 0 0 0 0 0 0 0 0 0 0 HSl And is made apparent nan nan nan Y nan
127720 (100:10:2:1) 100:10 100 10 2 1 77110 0 maA m? ??? REL Relative Pronoun STEM POS:REL LEM:maA Stem nan nan nan nan nan nan nan 8 Stem Relative Pronoun maA maA\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 ma' nan \0\0\0\0\0\0\0\0\0\0 nan nan nan nan nan 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 1 0 0 1 0 0 PORP (as) nan nan nan Y nan nan N nan 0 1
127721 (100:10:3:1) 100:10 100 10 3 1 77111 0 fiY f? ??? P Particle-Preposition STEM POS:P LEM:fiY Stem nan nan nan nan nan nan nan 8 Stem Particle-Preposition fiY fiY\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 fiY nan \0\0\0\0\0\0\0\0\0\0 nan nan nan nan nan 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 1 0 0 1 0 0 PORP in nan nan nan Y nan nan N nan 0 1
127722 (100:10:4:1) 100:10 100 10 4 1 77112 0 {l l-?ud?ri ?????????? DET Definite Article PREFIX Al+ Prefix nan nan nan nan nan nan nan 8 Prefix Definite Article nan \0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 nan nan \0\0\0\0\0\0\0\0\0\0 nan nan nan nan nan 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 1 0 0 1 0 0 nan of what is in the breasts nan nan nan Y nan nan N nan 0 1 nan
127723 (100:10:4:2) 100:10 100 10 4 2 77112 0 S~uduwri l-?ud?ri ?????????? N Noun STEM POS:N LEM:Sador ROOT:Sdr MP GEN Stem Genitive nan nan nan nan nan nan 8 Stem Noun Sador Sador\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 ?ador Sdr Sdr\0\0\0\0\0\0\0 ?dr nan Masculine Genitive Plural 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 1 0 0 1 0 0 Sdr of what is in the breasts nan nan nan Y nan nan N
127724 (100:11:1:1) 100:11 100 11 1 1 77113 0 <in~a inna ????? ACC Accusative Particle STEM POS:ACC LEM:<in~ SP:<in~ Stem Accusative nan nan nan nan nan nan 8 Stem Accusative Particle <in~ <in~\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 <in~ nan \0\0\0\0\0\0\0\0\0\0 nan <in~ nan nan nan 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 0 0 0 0 0 0 PORP indeed nan nan nan Y nan 'inna N nan 0
127725 (100:11:2:1) 100:11 100 11 2 1 77114 0 rab~a rabbahum ???????? N Noun STEM POS:N LEM:rab~ ROOT:rbb M ACC Stem Accusative nan nan nan nan nan nan 8 Stem Noun rab~ rab~\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 rab~ rbb rbb\0\0\0\0\0\0\0 rbb nan Masculine Accusative Singular 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 0 0 0 0 0 0 rbb their Lord nan nan nan Y nan nan N
127726 (100:11:2:2) 100:11 100 11 2 2 77114 0 hum rabbahum ???????? PRON Personal Pronoun SUFFIX PRON:3MP Suffix nan nan nan nan nan nan nan 8 Suffix Personal Pronoun nan \0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 nan nan \0\0\0\0\0\0\0\0\0\0 nan nan Masculine nan Plural 3 nan nan nan nan nan 0 0 0 0 0 0 0 0 0 0 0 0 0 0 nan their Lord nan nan nan Y nan nan N nan 0 1 nan
127727 (100:11:3:1) 100:11 100 11 3 1 77115 0 bi bihim ?????? P Particle-Preposition PREFIX bi+ Prefix nan nan nan nan nan nan nan 8 Prefix Particle-Preposition nan \0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0 nan nan \0\0\0\0\0\0\0\0\0\0 nan nan nan nan nan 0 nan nan nan nan nan 0 0 0 0 0 0 0 0 0 0 0 0 0 0 PORP them nan nan nan Y nan nan N nan 0 1 nan