searxng

Commit Graph

Author	SHA1	Message	Date
Bnyro	2bab658d39	[feat] engine: implementation of seekr for news, images and videos	2023-08-15 16:17:42 +02:00
Markus Heiser	7052a1a7cf	[drop] engine archive.is / blocked by CAPTCHA [1] Sadly archive.is is blocked by a CAPTCHA that can't be avoid (at least in a XPath engine). [1] https://github.com/searxng/searxng/issues/2643 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-14 08:50:13 +02:00
ToxyFlog1627	f175574f37	[fix] typos in documentation & messages	2023-08-13 08:50:29 +02:00
Bnyro	c76c999a1e	[feat] engine: implementation of anaconda	2023-08-10 23:51:07 +02:00
Bnyro	e25d1c7288	[feat] engine: implementation of German news, Tagesschau Co-authored-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-10 20:27:54 +02:00
GeckoLinux	2256ba2ffb	[mod] settings.yml: move wolframalpha to categ general & disable by default As suggested in comment [1] as a good compromise between [2] and [3]. This will help users that want Wolfram Alpha instant answers by default with every query and don't mind the slowness. It will be disabled by default so it's fast for all users, and those that need it can enable it as a user-configurable option without requiring setting up their own instance to change the Wolfram Alpha categorization. [1] https://github.com/searxng/searxng/pull/813#issuecomment-1574851572 [2] https://github.com/searxng/searxng/pull/813#issuecomment-1020581765 [3] https://github.com/searxng/searxng/issues/966#issue-1171107451	2023-08-10 14:31:09 +02:00
Markus Heiser	c381fc001f	[mod] settings: remove lemmy from categ 'general' & enable by default Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-10 12:58:40 +02:00
Bnyro	224f2250ae	[feat] engine: support for lemmy communities, posts, comments and users	2023-08-10 12:58:40 +02:00
allendema_searxng_pi	c00c0c5434	[mod] remove discontinued petalsearch engines	2023-08-09 07:17:40 +02:00
Markus Heiser	460bbe5b81	[mod] implement brave (WEB) engine to replace XPath configuration Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-08 16:21:45 +02:00
Bnyro	d151497db3	[feat] engine: brave - support for news	2023-08-08 16:21:45 +02:00
Bnyro	cae06f2781	[feat] engine: brave - support for videos	2023-08-08 16:21:45 +02:00
Bnyro	73364e158e	[feat] engine: brave - support for images	2023-08-08 16:21:45 +02:00
Markus Heiser	a2108432f7	[fix] settings.yml: typo in engine name wikipecies -> wikispecies Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-07 10:35:44 +02:00
Emilien Devos	0fc8f99ecc	[feat] new engine: bt4g added & enabled and disable by default btdigg Disable btdigg because on most SearXNG instances, SearXNG is blocked by btdigg due to cloudflare too many requests. This impementation did not parse the HTML page because there is an API in XML (RSS). The RSS feed provides fewer data like amount of seeders/leechers and the files in the torrent file. It's a tradeoff for a "stable" engine as the XML from RSS content will change way less than the HTML page. Closes: https://github.com/searxng/searxng/issues/2553	2023-08-06 09:30:48 +02:00
Markus Heiser	dad16c0949	[mod] settings.yml: enable engines from Wikimedia Foundation The Wikimedia wikis [1] engines provide good answers and have short response times --> no reason to disable these enhgines by default. BTW: this patch adds a (sub-) category ``wikimedia`` for the engines [1]. [1] https://meta.wikimedia.org/wiki/Wikimedia_wikis Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-04 19:06:50 +02:00
Markus Heiser	db522cf76d	[mod] engine: wikimedia - improve results, add addition settings & doc Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-04 19:06:50 +02:00
Bnyro	7d8c20c80d	[feat] new engine: wikispecies	2023-08-04 19:06:50 +02:00
Markus Heiser	1b030d4b41	[doc] engine: Yacy Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 19:58:51 +02:00
zutto	ca518c6803	add option to change yacy search mode	2023-08-03 19:58:51 +02:00
Markus Heiser	ef5831cd84	[mod] engine piped: split into two dedicated engiens for video & music Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 16:23:36 +02:00
Markus Heiser	7aa95d2d52	[doc] engine piped: add documentation to docs/dev/engines/online/ Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-08-03 16:23:36 +02:00
Bnyro	636bfdac68	[feat] engine: implementation of Piped	2023-08-03 16:23:36 +02:00
mrpaulblack	65d8b1a310	[fix] remove disabled: false from engine definitions in settings.yml * setting disabled: false is not needed, since it is by default enabled	2023-07-22 18:19:01 +02:00
Paolo Basso	cada89ee36	[feat] engine: re-enables z-library (zlibrary-global.se) - re-enables z-library as the new domain zlibrary-global.se is now available from the open web. The announcement of the domain: https://www.reddit.com/r/zlibrary/comments/13whe08/mod_note_zlibraryglobalse_domain_is_officially/ It is an official domain, it requires to log in to the "personal" subdomain only to download files, but the search works. - changes the result template of zlibrary to paper.html, filling the appropriate fields - implements language filtering for zlibrary - implement zlibrary custom filters (engine traits) - refactor and document the zlibrary engine	2023-07-07 21:36:51 +02:00
Hackurei	cb92767f19	[feat] enigine: add CrowdView forum search engine	2023-07-07 21:36:11 +02:00
Markus Heiser	e2df6b77a3	[mod] engine: Anna's Archive - additionl settings (content, sort, ext) Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-29 09:32:57 +02:00
Markus Heiser	eafc2906f1	[mod] engine: Anna's Archive - fetch search arguments from search form Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-29 09:32:57 +02:00
Paolo Basso	7adb9090e5	[mod] engine: Anna's Archive - add language support	2023-06-29 09:32:57 +02:00
Paolo Basso	e5637fe7b9	[feat] engine: implementation of Anna's Archive Anna's Archive [1] is a free non-profit online shadow library metasearch engine providing access to a variety of book resources (also via IPFS), created by a team of anonymous archivists [2]. [1] https://annas-archive.org/ [2] https://annas-software.org/AnnaArchivist/annas-archive	2023-06-29 09:32:57 +02:00
Paolo Basso	401561cb58	[mod] engine torznab - refactor & option to hide links - torznab engine using types and clearer code - torznab option to hide torrent and magnet links. - document the torznab engine - add myself to authors Closes: https://github.com/searxng/searxng/issues/1124 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-28 10:03:44 +02:00
Markus Heiser	f77807257b	[fix] engines: don't spam marginalia.nu with default settings The engine configuration of marginalia [2][3][4][5] spams marginalia.nu with requests from SearXNG instances [1]. It is not in the interest of SearXNG to disturb other FOSS projects, so the engine will be removed:: - name: marginalia engine: json_engine shortcut: mar categories: general paging: false # Key and license: https://www.marginalia.nu/marginalia-search/api/ # index: 0 popular, 1 blogs, 2 big_sites, 3 default, 4 experimental search_url: https://api.marginalia.nu/<insert your key here>/search/{query}?index=4&count=20 results_query: results url_query: url title_query: title content_query: description timeout: 1.5 disabled: true about: website: https://www.marginalia.nu/ official_api_documentation: https://api.marginalia.nu/ use_official_api: true require_api_key: true results: JSON [1] https://github.com/searxng/searxng/issues/1673 [2] https://github.com/searxng/searxng/pull/1627 [3] https://github.com/searxng/searxng/issues/1620 [4] https://news.ycombinator.com/item?id=35874640 [5] `d82a858491/code/services-satellite/api-service/src/main/java/nu/marginalia/api/svc/ResponseCache.java (L12-L20)` Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-06-05 08:23:17 +02:00
Markus Heiser	007a615ffa	[mod] donation_url: disable by default SearXNG's donation campaign has been ended. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-05-15 09:19:17 +02:00
Markus Heiser	45529f51a1	Merge pull request #2347 from return42/mod-lang-detection If language recognition fails use the Accept-Language	2023-04-25 15:46:26 +02:00
Markus Heiser	f1b6351ae1	[fix] engine: google play movies Closes: https://github.com/searxng/searxng/pull/1746 Closes: https://github.com/searxng/searxng/issues/1599 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-16 19:15:44 +02:00
Markus Heiser	8adbc4fcec	[mod] settings.yml: enable language detection by default_lang (auto) Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-15 22:24:59 +02:00
Markus Heiser	5234e45010	[fix] Gigablast.com has been erased [1] https://www.reddit.com/r/searchengines/comments/128wdcp/gigablastcom_has_been_erased/ Closes: https://github.com/searxng/searxng/issues/2321 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-04-06 08:22:57 +02:00
Markus Heiser	2b8dfab33f	[fix] engine gigablast: add &userid=<User ID>&code=<Feed Code> Gigablast's API does block unauthorized request[1]. [1] https://gigablast.com/searchfeed.html Closes: https://github.com/searxng/searxng/issues/1454 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-29 16:18:02 +02:00
Markus Heiser	2499899554	[mod] Google: reversed engineered & upgrade to data_type: traits_v1 Partial reverse engineering of the Google engines including a improved language and region handling based on the engine.traits_v1 data. When ever possible the implementations of the Google engines try to make use of the async REST APIs. The get_lang_info() has been generalized to a get_google_info() function / especially the region handling has been improved by adding the cr parameter. searx/data/engine_traits.json Add data type "traits_v1" generated by the fetch_traits() functions from: - Google (WEB), - Google images, - Google news, - Google scholar and - Google videos and remove data from obsolete data type "supported_languages". A traits.custom type that maps region codes to supported_domains is fetched from https://www.google.com/supported_domains searx/autocomplete.py: Reversed engineered autocomplete from Google WEB. Supports Google's languages and subdomains. The old API suggestqueries.google.com/complete has been replaced by the async REST API: https://{subdomain}/complete/search?{args} searx/engines/google.py Reverse engineering and extensive testing .. - fetch_traits(): Fetch languages & regions from Google properties. - always use the async REST API (formally known as 'use_mobile_ui') - use supported_domains from traits - improved the result list by fetching './/div[@data-content-feature]' and parsing the type of the various content features --> thumbnails are added searx/engines/google_images.py Reverse engineering and extensive testing .. - fetch_traits(): Fetch languages & regions from Google properties. - use supported_domains from traits - if exists, freshness_date is added to the result - issue 1864: result list has been improved a lot (due to the new cr parameter) searx/engines/google_news.py Reverse engineering and extensive testing .. - fetch_traits(): Fetch languages & regions from Google properties. supported_domains is not needed but a ceid list has been added. - different region handling compared to Google WEB - fixed for various languages & regions (due to the new ceid parameter) / avoid CONSENT page - Google News do no longer support time range - result list has been fixed: XPath of pub_date and pub_origin searx/engines/google_videos.py - fetch_traits(): Fetch languages & regions from Google properties. - use supported_domains from traits - add paging support - implement a async request ('asearch': 'arc' & 'async': 'use_ac:true,_fmt:html') - simplified code (thanks to '_fmt:html' request) - issue 1359: fixed xpath of video length data searx/engines/google_scholar.py - fetch_traits(): Fetch languages & regions from Google properties. - use supported_domains from traits - request(): include patents & citations - response(): fixed CAPTCHA detection (Scholar has its own CATCHA manager) - hardening XPath to iterate over results - fixed XPath of pub_type (has been change from gs_ct1 to gs_cgt2 class) - issue 1769 fixed: new request implementation is no longer incompatible Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-24 10:37:42 +01:00
Markus Heiser	a7fe22770a	[mod] Peertube: re-engineered & upgrade to data_type: traits_v1 - fetch_traits(): Fetch languages from peertube's search-index source code. [mod] Include migration of the request methode from 'supported_languages' to 'traits' (EngineTraits) object. [fix] old supported_languages_url is no longer valid since the sources has been moved to a different path. - fixed code to pass pylint - request(): complete re-implementation based on the API docs [1] - response(): complete re-implementation, adds serveral fields missed before - add source code documentation [1] https://docs.joinpeertube.org/api-rest-reference.html#tag/Search/operation/searchVideos Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-03-24 10:37:42 +01:00
Solirs	35fbb3578b	Increase timeout for gentoo wiki engine	2023-02-28 13:54:44 +01:00
Markus Heiser	5820dc78ce	[doc] slight improvements to the doc of the settings (base_url) Closes: https://github.com/searxng/searxng/issues/2190 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-02-17 12:08:58 +01:00
Markus Heiser	52f6bc745b	Merge pull request #2188 from ahmad-alkadri/fix/petalsearch Fix the petalsearch engine	2023-02-15 13:57:28 +01:00
Ahmad Alkadri	f6af59899b	Fix petalsearch and remove petalsearch news	2023-02-14 18:43:55 +01:00
Markus Heiser	7d446dfdb2	[mod] disbale engine tineye by default Tineye becomes active as soon as a https:// signature is found in the search term, but most of the time a reverse image search is not requested when a URL is specified, often the URL is just from a C&P. The frequent requests to tineye lead in the end to the SearXNG instance being blocked by tineye and the user seeing unexpected error messages. BTW: many maintainers have disabled this engine in their local SearXNG settings. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-02-14 08:27:19 +01:00
Markus Heiser	3abff182ea	[fix] remove engine neeva from settings.yml Engine is broken and can't by used any longer as a simple XPath engine. @allendema tested a engines/neeva.py version using json from the dom, but without luck: There was some kind of captcha for pagination. [1] https://github.com/searxng/searxng/issues/2007#issuecomment-1426061698 Closes: https://github.com/searxng/searxng/issues/2007 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-02-10 18:46:37 +01:00
wibyweb	6707354bc8	[mod] engine wiby: add pagination Suggested by: @wibyweb in searx https://github.com/searx/searx/pull/3465 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>	2023-02-10 15:31:24 +01:00
Alexandre Flament	37addec69e	search.suspended_time settings: bug fixes * fix type in settings.yml: replace suspend_times by suspended_times * always use delay defined in settings.yml: * HTTP status 402 and 403: read the value from settings.yml instead of using the hardcoded value of 1 day. * startpage engine: CAPTCHA suspend the engine for one day instead of one week	2023-01-28 10:24:14 +00:00
Alexandre Flament	13b0c251c4	Merge pull request #2100 from nexryai/master Add goo engine	2023-01-15 23:08:28 +01:00
Léon Tiekötter	0cedb1c6d8	Add search.suspended_times settings Make suspended_time changeable in settings.yml Allow different values to be set for different exceptions. Co-authored-by: Alexandre Flament <alex@al-f.net>	2023-01-15 09:00:32 +00:00

1 2 3 4 5 ...

677 Commits