Module:families/data/documentation
< Module:families | data
This module contains definitions for all language family codes on Wiktionary. The entries are listed by their family code, which is usually an ISO-639 code, but there are also some exceptional codes. These are listed on Wiktionary:Families; that page also explains how to create new exceptional codes.
This module must not be imported using require
. Instead, it is imported like this:
local families = mw.loadData("Module:families")
This ensures that the data is only loaded once per page, rather than once for every module invocation like normal.
Errors
සංස්කරණයThe following errors were detected by Module:data consistency check:
- Norwegian Bokmål (
nb
) has Middle Norwegian (gmq-mno
) set as an ancestor, but is not in the West Scandinavian භාෂා (gmq-wes
). - Norwegian Bokmål (
nb
) has ඩෙන්මාර්ක (da
) set as an ancestor, but is not in the East Scandinavian භාෂා (gmq-eas
).
- Caribbean Hindustani (
hns
) has Bhojpuri (bho
) set as an ancestor, but is not in the Bihari භාෂා (inc-bih
). - Caribbean Hindustani (
hns
) has Awadhi (awa
) set as an ancestor, but is not in the Eastern Hindi භාෂා (inc-hie
).
- Proto-Central Togo (
alv-gtm-pro
) does not have the expected name "Proto-Ghana-Togo Mountain", even though it is the proto-language of the Ghana-Togo Mountain භාෂා (alv-gtm
). - Proto-Arawa (
auf-pro
) does not have the expected name "Proto-Arauan", even though it is the proto-language of the Arauan භාෂා (auf
). - Proto-Amuesha-Chamicuro (
awd-amc-pro
) has a proto-language code associated with the invalid code"awd-amc"
. - Proto-Kampa (
awd-kmp-pro
) has a proto-language code associated with the invalid code"awd-kmp"
. - Proto-Arawak (
awd-pro
) does not have the expected name "Proto-Arawakan", even though it is the proto-language of the Arawakan භාෂා (awd
). - Proto-Paresi-Waura (
awd-prw-pro
) has a proto-language code associated with the invalid code"awd-prw"
. - Proto-Ta-Arawak (
awd-taa-pro
) does not have the expected name "Proto-Ta-Arawakan", even though it is the proto-language of the Ta-Arawakan භාෂා (awd-taa
). - Proto-Rukai (
dru-pro
) has a proto-language code associated with Rukai (dru
), which is not a family. - Proto-Basque (
euq-pro
) does not have the expected name "Proto-Vasconic", even though it is the proto-language of the Vasconic භාෂා (euq
). - ප්රොටෝ-ජර්මානු (
gem-pro
) does not have the expected name "Proto-ජර්මානු", even though it is the proto-language of the ජර්මානු භාෂා (gem
). - Proto-Norse (
gmq-pro
) does not have the expected name "Proto-North Germanic", even though it is the proto-language of the North Germanic භාෂා (gmq
). - ප්රොටෝ-බටහිර ජර්මානු (
gmw-pro
) does not have the expected name "Proto-බටහිර ජර්මානු", even though it is the proto-language of the බටහිර ජර්මානු භාෂා (gmw
). - Proto-Kamta (
inc-krn-pro
) does not have the expected name "Proto-KRNB lects", even though it is the proto-language of the KRNB lects (inc-krn
). - Proto-Indo-Aryan (
inc-pro
) does not have the expected name "Proto-ඉන්දු-ආර්ය", even though it is the proto-language of the ඉන්දු-ආර්ය භාෂා (inc
). - ප්රොටෝ-ඉන්දු-යුරෝපීය (
ine-pro
) does not have the expected name "Proto-ඉන්දු-යුරෝපීය", even though it is the proto-language of the ඉන්දු-යුරෝපීය භාෂා (ine
). - Kelantan Peranakan Hokkien (
mis-hkl
) has its canonical name ("Kelantan Peranakan Hokkien"
) repeated in the table ofaliases
. - Proto-Chumash (
nai-chu-pro
) does not have the expected name "Proto-Chumashan", even though it is the proto-language of the Chumashan භාෂා (nai-chu
). - Proto-Maidun (
nai-mdu-pro
) does not have the expected name "Proto-Maiduan", even though it is the proto-language of the Maiduan භාෂා (nai-mdu
). - Proto-Mixe-Zoque (
nai-miz-pro
) does not have the expected name "Proto-Mixe-Zoquean", even though it is the proto-language of the Mixe-Zoquean භාෂා (nai-miz
). - Proto-Pomo (
nai-pom-pro
) does not have the expected name "Proto-Pomoan", even though it is the proto-language of the Pomoan භාෂා (nai-pom
). - Proto-Mazatec (
omq-maz-pro
) does not have the expected name "Proto-Mazatecan", even though it is the proto-language of the Mazatecan භාෂා (omq-maz
). - Proto-Ossetic (
os-pro
) has a proto-language code associated with Ossetian (os
), which is not a family. - Proto-North Sarawak (
poz-swa-pro
) does not have the expected name "Proto-North Sarawakan", even though it is the proto-language of the North Sarawakan භාෂා (poz-swa
). - Proto-Salish (
sal-pro
) does not have the expected name "Proto-Salishan", even though it is the proto-language of the Salishan භාෂා (sal
). - Proto-Puroik (
sit-khp-pro
) has a proto-language code associated with the invalid code"sit-khp"
. - Proto-Samic (
smi-pro
) does not have the expected name "Proto-Sami", even though it is the proto-language of the Sami භාෂා (smi
). - Proto-Kuki-Chin (
tbq-kuk-pro
) does not have the expected name "Proto-Kukish", even though it is the proto-language of the Kukish භාෂා (tbq-kuk
). - Proto-Saka (
xsc-sak-pro
) does not have the expected name "Proto-Sakan", even though it is the proto-language of the Sakan භාෂා (xsc-sak
). - Proto-Sarmatian (
xsc-sar-pro
) has a proto-language code associated with the invalid code"xsc-sar"
.
- Literary Chinese (
lzh-lit
) has a canonical name that is not unique; it is also used by the codelzh
. - The data key
preprocess_links
for Hacked Thai (th-new
) is invalid.
- Old Indo-Aryan භාෂා (
inc-old
) has no child families or languages.
- Literary Chinese, the canonical name for the code
lzh-lit
, is wrong; it should be Literary Chinese.
- Literary Chinese, the canonical name for the code
lzh-lit
, is wrong; it should be Literary Chinese.
- The code
ira-mid
and the canonical name Middle Iranian should be removed; they are not found in Module:families/data. - The code
ira-old
and the canonical name Old Iranian should be removed; they are not found in Module:families/data.
- The code
ira-mid
and the canonical name Middle Iranian should be removed; they are not found in Module:families/data. - The code
ira-old
and the canonical name Old Iranian should be removed; they are not found in Module:families/data.
- Code:
abs
. Saw name: Ambonese මැලේ. Expected name: Ambonese Malay. - Code:
acw
. Saw name: Hijazi අරාබි. Expected name: Hijazi Arabic. - Code:
acy
. Saw name: Cypriot අරාබි. Expected name: Cypriot Arabic. - Code:
aeb
. Saw name: Tunisian අරාබි. Expected name: Tunisian Arabic. - Code:
afb
. Saw name: Gulf අරාබි. Expected name: Gulf Arabic. - Code:
ajp
. Saw name: South Levantine අරාබි. Expected name: South Levantine Arabic. - Code:
ang
. Saw name: Old ඉංග්රීසි. Expected name: පුරාතන ඉංග්රීසි. - Code:
apc
. Saw name: North Levantine අරාබි. Expected name: North Levantine Arabic. - Code:
ary
. Saw name: Moroccan අරාබි. Expected name: Moroccan Arabic. - Code:
arz
. Saw name: Egyptian අරාබි. Expected name: Egyptian Arabic. - Code:
ayl
. Saw name: Libyan අරාබි. Expected name: Libyan Arabic. - Code:
br
. Saw name: Breton. Expected name: බ්රෙටන්. - Code:
cmn-ear
. Saw name: Mandarin. Expected name: මැන්ඩරීන්. - Code:
cy
. Saw name: Welsh. Expected name: වේල්ස. - Code:
dra-okn
. Saw name: Old කන්නඩ. Expected name: Old Kannada. - Code:
dum
. Saw name: Middle ඕලන්ද. Expected name: Middle Dutch. - Code:
enm
. Saw name: Middle ඉංග්රීසි. Expected name: මධ්යකාලීන ඉංග්රීසි. - Code:
fr-CA
. Saw name: French. Expected name: ප්රංශ. - Code:
frk
. Saw name: Proto-West Germanic. Expected name: ප්රොටෝ-බටහිර ජර්මානු. - Code:
frm
. Saw name: Middle ප්රංශ. Expected name: මධ්යකාලීන ප්රංශ. - Code:
fro
. Saw name: Old ප්රංශ. Expected name: පුරාතන ප්රංශ. - Code:
fro-nor
. Saw name: Old French. Expected name: පුරාතන ප්රංශ. - Code:
gd
. Saw name: Scottish Gaelic. Expected name: ස්කොට්ස් ගේලික්. - Code:
gem
. Saw name: Germanic. Expected name: ජර්මානු. - Code:
gem-pro
. Saw name: Proto-Germanic. Expected name: ප්රොටෝ-ජර්මානු. - Code:
gkm
. Saw name: Ancient Greek. Expected name: පුරාතන ග්රීක. - Code:
gmh
. Saw name: Middle High ජර්මානු. Expected name: Middle High German. - Code:
gml
. Saw name: Middle Low ජර්මානු. Expected name: Middle Low German. - Code:
gmq-mno
. Saw name: Middle නෝර්වීජියානු. Expected name: Middle Norwegian. - Code:
gmq-oda
. Saw name: Old ඩෙන්මාර්ක. Expected name: Old Danish. - Code:
gmq-osw
. Saw name: Old ස්වීඩන්. Expected name: Old Swedish. - Code:
gmw-ecg
. Saw name: East Central ජර්මානු. Expected name: East Central German. - Code:
gmw-jdt
. Saw name: Jersey ඕලන්ද. Expected name: Jersey Dutch. - Code:
gmy
. Saw name: Mycenaean ග්රීක. Expected name: Mycenaean Greek. - Code:
goh
. Saw name: Old High ජර්මානු. Expected name: Old High German. - Code:
grk-mar
. Saw name: Mariupol ග්රීක. Expected name: Mariupol Greek. - Code:
gsw
. Saw name: Alemannic ජර්මානු. Expected name: Alemannic German. - Code:
gv
. Saw name: Manx. Expected name: මැන්ක්ස්. - Code:
idb
. Saw name: Indo-පෘතුගීසි. Expected name: Indo-Portuguese. - Code:
inc-ash
. Saw name: Ashokan ප්රාකෘත. Expected name: Ashokan Prakrit. - Code:
kaw
. Saw name: Old ජාවා. Expected name: Old Javanese. - Code:
kw
. Saw name: Cornish. Expected name: කෝනිෂ්. - Code:
kxd
. Saw name: Brunei මැලේ. Expected name: Brunei Malay. - Code:
ltc
. Saw name: Middle චීන. Expected name: Middle Chinese. - Code:
meo
. Saw name: Kedah මැලේ. Expected name: Kedah Malay. - Code:
mga
. Saw name: Middle අයිරිෂ්. Expected name: Middle Irish. - Code:
ms-cla
. Saw name: Malay. Expected name: මැලේ. - Code:
ms-old
. Saw name: Malay. Expected name: මැලේ. - Code:
nds
. Saw name: Low ජර්මානු. Expected name: Low German. - Code:
nds-de
. Saw name: German Low ජර්මානු. Expected name: German Low German. - Code:
nod
. Saw name: Northern තායි. Expected name: Northern Thai. - Code:
obr
. Saw name: Old බුරුම. Expected name: Old Burmese. - Code:
och
. Saw name: Old චීන. Expected name: Old Chinese. - Code:
odt
. Saw name: Old ඕලන්ද. Expected name: Old Dutch. - Code:
oge
. Saw name: Old ජෝර්ජියානු. Expected name: Old Georgian. - Code:
ohu
. Saw name: Old හංගේරියානු. Expected name: Old Hungarian. - Code:
ojp
. Saw name: Old ජපන්. Expected name: Old Japanese. - Code:
okm
. Saw name: Middle කොරියානු. Expected name: Middle Korean. - Code:
oko
. Saw name: Old කොරියානු. Expected name: Old Korean. - Code:
osp
. Saw name: Old ස්පාඤ්ඤ. Expected name: Old Spanish. - Code:
ota
. Saw name: Ottoman තුර්කි. Expected name: Ottoman Turkish. - Code:
pal
. Saw name: Middle පර්සියානු. Expected name: Middle Persian. - Code:
pdc
. Saw name: Pennsylvania ජර්මානු. Expected name: Pennsylvania German. - Code:
peo
. Saw name: Old පර්සියානු. Expected name: Old Persian. - Code:
rmg
. Saw name: Traveller නෝර්වීජියානු. Expected name: Traveller Norwegian. - Code:
roa-opt
. Saw name: Old Galician-පෘතුගීසි. Expected name: Old Galician-Portuguese. - Code:
ruo
. Saw name: Istro-රුමේනියානු. Expected name: Istro-Romanian. - Code:
ruq
. Saw name: Megleno-රුමේනියානු. Expected name: Megleno-Romanian. - Code:
sa-ved
. Saw name: Sanskrit. Expected name: සංස්කෘත. - Code:
sga
. Saw name: Old අයිරිෂ්. Expected name: Old Irish. - Code:
sit-pro
. Saw name: Proto-Sino-ටිබෙට්. Expected name: Proto-Sino-Tibetan. - Code:
sou
. Saw name: Southern තායි. Expected name: Southern Thai. - Code:
tbq-lob-pro
. Saw name: Proto-Lolo-බුරුම. Expected name: Proto-Lolo-Burmese. - Code:
trk-oat
. Saw name: Old Anatolian තුර්කි. Expected name: Old Anatolian Turkish. - Code:
xaa
. Saw name: Andalusian අරාබි. Expected name: Andalusian Arabic. - Code:
xcl
. Saw name: Old ආමේනියානු. Expected name: Old Armenian. - Code:
xno
. Saw name: Old French. Expected name: පුරාතන ප්රංශ. - Code:
zlw-ocs
. Saw name: Old චෙක්. Expected name: Old Czech. - Code:
zlw-opl
. Saw name: Old පෝලන්ත. Expected name: Old Polish.
Required values
සංස්කරණයEvery entry in the table must contain the following properties:
canonicalName
- The "canonical" name of the family. This is the name that is used in Wiktionary entries and category names.
Optional values
සංස්කරණයaliases
- A list of aliases/synonyms for the family, other than the canonical name.
varieties
- A table of family varieties that are subsumed under the family. This should not include those varieties for which separate family codes have been assigned. If a given variety has several names, they can all be listed by including a sublist in the overall list, where the first element is the canonical name that you want the variety to be known by, and the remainder are aliases. This facility is more commonly used for individual languages than families, but the facility is also provided for families. See Template:language extradata documentation for an example.
otherNames
(deprecated)- A table of all non-canonical names that this family is known by, including both synonyms and varieties. This should not be used in new families, and existing families should have the entries in this list moved into either
aliases
orvarieties
. protoLanguage
- The language code for the proto-language of the family; that is, the common ancestor of all languages in the family. If this is not specified, Module:families will attempt to create the proto-language code by adding
-pro
to the family code — for instance,ine
(Indo-European languages) →ine-pro
(Proto-Indo-European language) — and then use Module:languages to check if the code is valid. family
- The superfamily that the language family belongs to, if any.
wikidata_item
- The number in the Wikidata item id for the language family.