Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Forvo cloud stopped working 10 hours ago #146

Open
voothi opened this issue Mar 26, 2024 · 6 comments
Open

Forvo cloud stopped working 10 hours ago #146

voothi opened this issue Mar 26, 2024 · 6 comments
Labels
bug Something isn't working

Comments

@voothi
Copy link

voothi commented Mar 26, 2024

Describe the bug
Hello!
Forvo cloud stopped working 10 hours ago. I tried version 0.11.1 and 0.12.0.
I also tried through different internet connections.

To Reproduce
Steps to reproduce the behavior:
In GUI VocabSieve 0.11.1 / 0.12.0 under Windows 11.
In mode
General / Target language = German
I connect Forvo to
Configure / Sources / Pronunciation sources / Enabled pronunciation sources / Forvo.
Checking the settings
Lemmatization policy for pronunciation = Try lemma first, otherwise original
I make Lookup words from the captured sentence.
The word is not reproduced.
The forvo/... file below is not displayed. Only local audio files from local audio libraries are displayed.

Expected behavior
Audio file for selected German word:

  • Downloaded from cloud Forvo.
  • Displayed at the bottom of the application's main screen.
  • Plays automatically.

Screenshots
image
image

Logs

Desktop (please complete the following information):

  • OS: Windows 11.
  • Vocabsieve version (if nightly, must be latest): VocabSieve 0.11.1 / 0.12.0.

Additional context
Telegram thread:
Hello! Forvo stopped working 10 hours ago. I tried version 0.11.1 and 0.12.0.

@voothi voothi added the bug Something isn't working label Mar 26, 2024
@1over137
Copy link
Contributor

1over137 commented Mar 26, 2024

I assume they stepped up their anti scraping measures with Cloudflare, as reported elsewhere with similar projects:

Rascalov/Anki-Simple-Forvo-Audio#31

Rascalov/Anki-Simple-Forvo-Audio#29

In that case there is nothing I can do. Stick to downloaded stuff, probably. If this persists I'll remove the implementation. Maybe I'll look into fetching audio from wiktionary. I'm personally unwilling to do this, but if someone is willing to make a fancy proxy service somewhere like lingva, please let me know.

@Rascalov
Copy link

Rascalov commented Mar 26, 2024

I assume they stepped up their anti scraping measures with Cloudflare, as reported elsewhere with similar projects:

It's an oddball. My extension seemed to work again after I visited the site once on my actual browser to do the Cloudflare Captcha.
My guess is that it will whitelist IPs from the check once it's been gone through at least once.
But then again, I have yet to hear from people if that actually worked for them as well.

@otomiruu
Copy link

Я предполагаю, что они усилили свои меры по борьбе с парсингом с помощью Cloudflare, как сообщалось в других источниках о подобных проектах:

Это чудак. Мое расширение, похоже, снова заработало после того, как я однажды посетил сайт в своем браузере, чтобы ввести Cloudflare Captcha. Я предполагаю, что он внесет IP-адреса в белый список из проверки, как только она будет проверена хотя бы один раз. Но опять же, я еще не слышал от людей, сработало ли это и для них.

Сейчас зашел в Anki, и все заработало, на сайте Forvo не заходил, но если появится данная проблема еще раз, сделаю, как вы сказали. Еще раз спасибо за этот прекрасный аддон

@Rascalov
Copy link

Сейчас зашел в Anki, и все заработало, на сайте Forvo не заходил, но если появится данная проблема еще раз, сделаю, как вы сказали. Еще раз спасибо за этот прекрасный аддон

No problem! But I assume you wanted to send this reply in your own created issue 😅

@1over137
Copy link
Contributor

From what I see, they are still under Cloudflare reverse proxy. The DNS points to a cloudflare IP: http://104.20.253.20/. I assume they are testing this. Most likely they'll just block it again permanently, and then there won't be good solutions other than a full browser scraping script.

@Rascalov
Copy link

Rascalov commented Mar 26, 2024

yeah, there are people (freemdict) that were already posting full site scrapes of audios. Can't say how recent these are though It's still being updated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants