Suchmaschinen

Categories: search

Suchmaschinen

Die 30 besten Suchmaschinen im Kurztest 2021

Freie Suchmaschinensoftware

CC - Content Search:

Startups Search: Search Less. Close More. Grow your revenue with all-in-one rospecting solutions powered by the leader in private-company data.

Viral content :

Self hosted

Self hosted seach engine:

Crawler Apache Nutch

Elastic Search

Search Elastic Search

Suchoberfläche Calaca:

Ambar

Apache Solr

Apache Solr™ 9.2.1 - Solr is the popular, blazing-fast, open source enterprise search platform built on Apache Lucene™

https://lucene.apache.org/solr/

Apache Nutch

https://www.apache.org/dyn/closer.lua/nutch/1.16/apache-nutch-1.16-bin.zip

Scrapy (Python)

Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Heritrix (Java)

Heritrix is the Internet Archive’s open-source, extensible, web-scale, archival-quality web crawler project.

WebSPHINX (Java)

WebSPHINX ( Website-Specific Processors for HTML INformation eXtraction) is a Java class library and interactive development environment for web crawlers. A web crawler (also called a robot or spider) is a program that browses and processes Web pages automatically.

openseachserver

Links

http://www.intellspot.com/open-source-web-crawlers/

Written on April 22, 2020