Bantu Languages Community

Permanent URI for this communityhttps://dspace.fiti.info/handle/123456789/1

The Bantu Languages Community is the largest language family represented in the Kenyan Languages Archive. Bantu languages belong to the vast Niger-Congo phylum and are spoken by the majority of Kenya's population, from the central highlands to the coastal strip and lake region. This community houses documentary and archival materials for all Bantu languages indigenous to Kenya, organised by individual language sub-community and, within each, by dialect collection.

Bantu languages in Kenya are characterised by a noun-class system, agglutinative morphology, and a rich oral literary tradition spanning epic narratives, riddles, proverbs, and song. Several Bantu languages — including Kiswahili, Gikuyu, and Kikamba — have substantial written traditions and standardised orthographies, while others such as Rabai and Kauma of the Mijikenda cluster are critically endangered with very few remaining speakers.

Browse

Communities in this Community

Now showing 1 - 8 of 8
  • Archive for Ekegusii, spoken in Kisii and Nyamira counties. Dialect collections for the two attested varieties: Rogoro and Maate , which differ primarily in vocabulary and the articulation of /t/. ISO 639-3: guz. Glottolog: gusi1247.
  • Covers regional dialects of Nyeri, Murang'a, Kiambu, and Kirinyaga counties, including oral poetry, folk narrative, and linguistic descriptive materials. ISO 639-3: kik. Glottolog: kiku1240.
  • Documentary archive for Kikamba, spoken across Kitui, Machakos, Makueni and Nairobi counties. Sub-collections by dialect region. ISO 639-3: kam. Glottolog: kamb1297.
  • Archive for Kĩmîîrû and related Meru-group varieties: Imenti, Tigania, Igembe, Chuka, and Tharaka. ISO 639-3: mer (Meru); thk (Tharaka). Glottolog: meru1245.
  • Documentary archive for native coastal Kiswahili varieties spoken in Kenya, including Kimvita (Mombasa), Bajuni/Tikuu (Lamu Archipelago and Bajuni Islands), Chifundi (Wasini Island, Kwale coast), and Kiamu (Lamu literary dialect). The Kenyan coastal Swahili dialects including Kimvita, Bajuni/Tikuu, and Chifundi, are distinct from standard Swahili, which is largely based on the Kiunguja dialect of Zanzibar. These indigenous dialects, along with others like Kiamu, are considered highly endangered due to the dominance of standard Swahili and English. Note: Bajuni/Tikuu is sometimes classified as a distinct Bantu language rather than a Swahili dialect. ISO 639-3: swh. Glottolog: swah1253.
  • Archive for the Taita language cluster (Northeast Bantu, Niger-Congo) of Taita-Taveta County, south-eastern Kenya. Encompasses Kidaw'ida / Daw'ida (ISO 639-3: dav), Kisaghala / Saghala (ISO 639-3: tga), and Kikasigau / Kasigau. Kidaw'ida is the largest variety and the principal target of NLP and corpus development initiatives. Saghala is linguistically distinct enough to be considered a separate language. Collects recordings, texts, oral literature, and lexicographic materials across all Taita varieties; provenance recorded at item level.
  • <p> Luhya is not a single uniform language but a cluster of closely related dialects forming a dialect continuum. Major varieties include Lubukusu, Lulogooli (Maragoli), Luwanga, Lusamia, Lwisukha, Lwidakho, and others, spoken across the counties of Kakamega, Bungoma, Vihiga, Busia, and parts of Trans-Nzoia. The language cluster carries the ISO 639-3 macrolanguage code <strong>luy</strong> and falls within the JE.32 Guthrie classification for Bantu languages. </p>
  • Archive for the nine Mijikenda language varieties: Giriama, Digo, Duruma, Rabai, Kauma, Ribe, Kambe, Chonyi, and Jibana. Includes kaya oral traditions and ritual language. ISO 639-3: nyf (Giriama); dig (Digo). Glottolog: miji1238.

Materials in this community are subject to access terms specified at item level. Most items are released under CC BY 4.0 or CC BY-NC 4.0. Items involving sacred oral traditions or community-restricted knowledge are deposited under a Community Consent Agreement. For enquiries and to learn more, reach out to respective dataset/artefact issuer