Skip to content

Latest commit

 

History

History
49 lines (29 loc) · 1.62 KB

TDD-C-202208-CC-002.md

File metadata and controls

49 lines (29 loc) · 1.62 KB

TrBanking77

This is a Turkish intent detection dataset, created by translating Banking77 using Google Translate. The queries are in banking domain. Each example has a single intent label. Some of the sentences are duplicated because some of the similar sentences are translated to same exact form in Turkish.

Samples

{"id": 0, "label": "card_arrival", "text": "Hala kartımı bekliyorum?"}
{"id": 1, "label": "card_arrival", "text": "Kartım 2 hafta geçmesine rağmen hala gelmediyse ne yapabilirim?"}
{"id": 153, "label": "card_linking", "text": "Kartım bulundu. Uygulamaya geri koymamın bir yolu var mı?"}
{"id": 154, "label": "card_linking", "text": "Kartımı bulabildim. Uygulamama nasıl koyabilirim?"}

Fields

field description type
id example id int
label label of the example string
text text of the example string

Splits

The number of examples per train and test splits are as follows:

Train Set Test Set
Number of examples 10003 3080

Dataset Creation

Curation Rationale

The dataset is motivated by the desire to advance intent detection and short text classification in Turkish language.

Additional Information

Dataset Curators

Kaan Alkan, Mustafa Erden, Levent M. Arslan

License

This dataset is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International license.