维基词典:语系

维基词典,自由的多语言词典
本页或以下篇幅是一项有关维基词典的方针、指引或流程的提议。该提议可能仍需要进一步的完善,需要更多讨论并进一步获得社群的共识来确立。请勿在引用或者链接中把本页的描述作为一项正式的方针或指引。

维基词典将语言分为不同的家族。大多数语系是借由共同祖先而联系在一起的,但也有少数语系仅仅是普通分类,如“克里奥尔语和皮钦语”。维基词典按语系组织语言分类派生类别。每个语系在维基词典上都有一个名称和一个代码。

关于语言和方言的讨论,请分别参见Wiktionary:语言Wiktionary:方言

关于所有语系的代码列表,请参阅Wiktionary:语系列表

语系代码[编辑]

Wiktionary represents families by codes. Family codes are stored in Module:families along with related data. These codes are generally three letters, or three letters followed by a hyphen followed by another three letters. Exceptionally, they may be other strings. Family codes can reasonably be used in etymology templates like {{der}}, {{bor}} and {{inh}} and of course in the namespace dedicated to reconstructions.

Genetic families[编辑]

Genetic families are groups of languages which have a common ancestry. Wiktionary follows prevailing scholarship when grouping languages into genetic categories. Each genetic family has a code.

Many genetic families have an ISO 639-5 code. When one is available, it is used on Wiktionary as well. For example, the Austroasiatic languages are aav, the Celtic languages are cel, the Germanic languages are gem.

When a family has no standard ISO-639-5 code, but one of its superfamilies does have a code, Wiktionary assigns it a two part exceptional code. The first part is the ISO-639-5 code of its nearest superfamily, and the second part is a series of three lowercase letters which generally approximate the name of the family. For example, the Pama-Nyungan family is aus-pam: "aus" is the ISO 639-5 code for Australian languages, "pam" abbreviates "Pama-Nyungan". The Brythonic languages are cel-bry; Jewish Aramaic languages are sem-jar; South Bird's Head languages are ngf-sbh.

When neither a family nor any of its superfamilies has an ISO 639-5 code, the special code "qfa" is used as the first part, with the "q..." range being allowed by the ISO for private use and "fa" standing for "family". (This system was devised on IRC and in the BP.) For example, the Misumalpan languages are qfa-min.

Non-genetic categories and isolates[编辑]

Not all language groupings are genetic. Some groups contain languages with other common properties, or languages spoken in a certain area. The following are recognised:

  • art = Artificial/constructed languages, which are purposely created by linguists or hobbyists. Some may be widely used (such as Esperanto), but most are only limited to small communites (like Láadan).
  • crp = Creole or pidgin languages, which developed as a means of communication between groups that had no common language. For example, Krio, Zamboanga Chavacano, Greenlandic Eskimo Pidgin.
  • sgn = Sign languages (such as ASL), which are not spoken but communicated through gesturing.
  • qfa-mix = Mixed languages, which formed by roughly equal mixing of two or more languages, usually by speakers that spoke the source languages fluently.
  • qfa-iso = Language isolates, which have no demonstrated relationship to any other language (such as Ainu).
  • qfa-und = Languages with an unknown or undetermined affiliation. These differ from language isolates in that there is no linguistic consensus or maybe even no research at all on the further relationships of the languages. This code is used by most languages that have not yet had a family assigned to them by Wiktionary editors. This code is also used for language families which are not part of a larger family.
  • qfa-not = Languages that do not belong to a family in principle, because they are not true languages. "Translingual" (mul), "Undetermined" (und) and "substrate" (qfa-sub) are not languages, and cannot be considered members of any family. This code is also used for groupings of languages which are treated on Wiktionary as families, but which are not related. The preceding special family codes themselves also use this code.

语系名称[编辑]

  • 所有的语系名称在各自的词条中都应被定义为中文单词。
  • 每一语系只使用一个名称(用于词源、分类和其他方面)。如果有两个或两个以上的名称,则选择其中一个。
  • 为明确起见,应尽可能避免语系和语言使用相同的名称。
  • 语系的其他名称应在词条的“同义词”或“替代形式”章节中列出。

参见[编辑]