Supported LanguagesΒΆ

The ChatterBot Corpus includes conversation data for the following 26 languages. Each language directory contains one or more YAML files covering different conversation topics (such as greetings, food, humor, etc.).

  • Bengali

  • Chinese

  • Dutch

  • English

  • French

  • German

  • Hebrew

  • Hindi

  • Hinglish

  • Indonesian

  • Italian

  • Japanese

  • Korean

  • Marathi

  • Oriya

  • Persian

  • Portuguese

  • Russian

  • Spanish

  • Swedish

  • Tamil

  • Telugu

  • Thai

  • Traditional Chinese

  • Turkish

  • Ukrainian

  • Urdu

  • Yoruba

Note

Coverage varies by language. Some languages (such as English and Japanese) include 20+ topic files, while others may have only a few. Contributions to expand the coverage of any language are welcome. See Contributing.