-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Less results than real Google #225
Comments
I'll quote what has been said already on a SearX issue about it:
|
I'm aware words are shared in many languages and that if you search, for example, the word Normally I'd be all for giving more choice to the user (given that the default should ALWAYS be [all]) but history has shown us it's not happening. Some instances block access to settings.php entirely. Some reset back to I'm tired boss. I just want to proxy my Google searches without being crippled. |
I see where you're coming from but that's not the "type of garbage" I meant. The garbage most likely got triggered by this Google message
which occurs more often when you narrow your search to only 1 language afaik. You're getting the results from AAA (aka GARBAGE, wildly unrelated) rather than the query BBB you were searching for your absolute best fix would be to find a public instance hosted in your country, set to find results in all languages, but good luck with it |
if an user wants to willingly exclude the vast majority of results AND trusts Google's language filter accuracy as well that's fine by me, then we go back to the initial, less radical suggestion of
|
I'm sorry for necroposting a bit. But I'm tired of always switching the language in the search setting between the two that give me correct results: first one being English, second one my native Slovak. Is this fixable the same way in LibreX? (e.g. put "all" in the results language?) Or can I do arrays, like [en,sk]? |
But Google is able to. As a result you shouldnt expect to get the exact same results between instances hosted in different countries (and I believe between all instances in general).
It's technically possible to make 2 search queries and combine the results of each. |
Using "all" now. Getting a lot of russian results at the top. Putting a language code in the search itself now, helps much more. |
LibreY has it, but only in the settings. |
(cross-posted in searxng/searxng#2438, benbusby/whoogle-search#1004)
It's been years. Google-searching through a "privacy search engine frontend" will rarely find as many results as the real Google.
Here's a simple test to verify that: come up with a unique Google query that will find as few results as possible, preferably not in English. For example in my test I used
"sfendazi"
but you might need your own unique query since the results come and go. Perform the same search on every public instance, and observe how many find the same results (if the results contain garbage unrelated stuff, consider it a failure). This was the outcome yesterday as of 2023-05-15:LibreX instances: 20 tested, 0 work
lmao
Whoogle instances: 17 tested, 3 work
https://s.tokhmi.xyz
https://whoogle.dcs0.hu
https://whoogle.privacydev.net
SearX/SearXNG instances: 92 tested, 19 work if you tweak a setting, only 1 works with defaults
(the only one that works with defaults is https://opnxng.com)
https://priv.au
https://xo.wtf
https://offtheradar.info
https://searx.oakleycord.dev
https://searx.cthd.icu
https://ooglester.com
https://search.bus-hit.me
https://myprivatesrx.us
https://coppedge.info
https://search.neet.works
https://search.zzls.xyz
https://search.us.projectsegfau.lt
https://s.frlt.one
https://searx.sev.monster
https://stalk.antelope.day
https://searx.esmailelbob.xyz
https://search.serginho.dev
https://search.cronobox.one
https://searx.mxchange.org
Those 19 instances I listed think they're "smart" and have set their
Search language
to[auto]
, which auto-selects it based on your browser headers... or they're simply set to something arbitrary, like[en-US]
. Choosing[all]
fixes the problem for them.Meanwhile, the rest of the instances somehow will not find the correct results even when set to
[all]
. From what I've tested with a local SearXNG instance, adding search query parameternfpr=1
(along with the pre-existingsafe=off
andfilter=0
) tosearxng/searx/engines/google.py
fixed it. Here's what they do:nfpr=1
-> Showing results for XXX Search instead for YYY ONsafe=off
-> SafeSearch OFFfilter=0
-> Include omitted results ONChanging the
Interface language
is fine. Actually, I'd argue language auto-detection should happen to the interface, not to the search results filter, which would be consistent with how major search engines work.Honestly, just take the
![](https://camo.githubusercontent.com/1df1db3c1f0ab91155c1a79fc5b8967761c653029ff4a8f96ffafc13abcfdedb/68747470733a2f2f692e696d6775722e636f6d2f716d724b71774f2e706e67)
Search language
option away, it does more harm than good. Or at least make[all]
the default and lock the option behind huge warning signs with flaming skulls that searching will be seriously degraded for everyone if anything other than[all]
is selected. People don't understand this is the equivalent setting they're touching (taken from Google's official advanced search page):TL;DR
Here's a picture to sum up the problem most search frontends are facing:
![](https://camo.githubusercontent.com/1f6333f7c6a24d35f8404fe27b4a36ca0c34ed89eaa26931db4c2b6bac5b076f/68747470733a2f2f692e696d6775722e636f6d2f463648576b44522e706e67)
Proposed fixes:
Search language
and default it to[all]
[auto]
to theInterface language
instead?safe=off&nfpr=1&filter=0
The text was updated successfully, but these errors were encountered: