Skip to content

Commit

Permalink
Update old tests to new results
Browse files Browse the repository at this point in the history
Various kinds of weird behavior that tests documented now just works
naturally, so that's good.
  • Loading branch information
polm committed Dec 20, 2024
1 parent c368f5c commit 0550ebd
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions cutlet/test/test_basic.py
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@
"Kouso to wa, seitai de okoru kagaku hannou ni taishite shokubai to shite kinou suru bunshi de aru.",
),
("ホッピーは元祖ビアテイスト清涼飲料水です", "Hoppy wa ganso beer taste seiryou inryousui desu"),
("東京タワーの高さは333mです", "Tokyo tower no takasa wa 333 m desu"),
("東京タワーの高さは333mです", "Tokyo tower no takasa wa 333m desu"),
(
"国立国語研究所(NINJAL)は,日本語学・言語学・日本語教育研究を中心とした研究機関です。",
"Kokuritsu kokugo kenkyuusho (NINJAL) wa, Nippon gogaku/gengogaku/Nippon go kyouiku kenkyuu wo chuushin to shita kenkyuu kikan desu.",
Expand All @@ -57,7 +57,7 @@
("《月》", "(gatsu)"), # issue #7, unfamiliar punctuation
("2 【電子版特典付】", "2 [denshi ban tokutentsuke]"), # issue #7
# This looks weird but MeCab tokenizes at alpha-num barriers
("cutlet23", "Cutlet 23"),
("cutlet23", "Cutlet23"),
# Test some kana unks - issue #8
("アマガミ Sincerely Your S シンシアリーユアーズ", "Amagami Sincerely Your S shinshiariiyuaazu"),
("ケメコデラックス", "Kemekoderakkusu"),
Expand Down Expand Up @@ -91,8 +91,8 @@
("Text McDonald's text", "Text McDonald's text"),
# Following are quote weirdness. Not good but hard to fix.
# An issue is that ," or .' is a single token.
("It's 'delicious.'", "It's ' delicious .'"),
('"Hello," he said.', '" Hello ," he said.'),
("It's 'delicious.'", "It's 'delicious.'"),
('"Hello," he said.', '"Hello," he said.'),
# this is a very strange typo
("アトランテッィク", "Atoranteku"),
# odoriji. Note at this point these rarely work properly, they mainly
Expand Down

0 comments on commit 0550ebd

Please sign in to comment.