Data Format

The language data is comprised of languages as rows along with sounds as columns. The data is contains scores for the languages for each of the sounds.

For Uto-Aztecan, we have 141 sounds for 40 languages. For Bantu, we have 287 sounds for 105 languages. See the example below.

LanguageAB...NPSTW...\\=A\\=W\\=e\\=o\\=u\\={3}
Northern_Paiute0.1431492840...01.43E-02008.18E-02...1.02E-026.13E-0306.13E-034.09E-030
Western_Mono0.1052631580...02.53E-02006.24E-02...3.90E-033.90E-0303.90E-0300
Tumpisa_Shoshone_Panamint_Koso0.1107266440...1.56E-021.56E-02009.52E-02...3.46E-031.73E-0303.46E-033.46E-030
Big_Smokey_Valley_Shoshone0.1339449540...07.34E-03000.100917431...5.50E-031.83E-0305.50E-035.50E-030
Western_Shoshone0.1140684410...01.71E-02009.32E-02...3.80E-031.90E-0305.70E-033.80E-030
Shoshone0.1252446180...07.83E-03000.105675147...5.87E-031.96E-0305.87E-033.91E-030
................................................
Pochutla_Mexicano7.78E-020...002.99E-0200...000000
Pipil0.1085594990...001.88E-0200...3.13E-0201.46E-0201.25E-020
Ipai5.87E-020...2.02E-036.07E-028.10E-0302.83E-02...5.67E-022.63E-022.02E-03000
Tewa9.23E-036.15E-03...1.23E-024.31E-026.15E-033.08E-030...3.08E-0306.15E-039.23E-033.08E-030
Zuni0.1381692570...1.73E-033.80E-022.76E-0200...3.45E-0303.45E-031.73E-0300
Kawaiisu0.1040650410...03.09E-023.25E-0308.29E-02...1.46E-024.88E-036.50E-033.25E-038.13E-030