Freeware Listing: Nutch
- Arch Search Engine
- License: Freeware

Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google\'s global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great.
- Publisher: CSIRO Astronomy and Space Science
- Date: 18-08-2016
- Size: 23338 KB
- Platform: Win2000, WinXP, Win7 x32, Win7 x64, Windows 8, Windows 10, WinServer, WinOther, WinVista, WinVista x64
- trouvA
- License: Freeware

Implementing an Open Source search engine, based on sphinx and nutch.
The aim of this project is to provide an open free alternative to Google.
A particular work will be done on french.
BTW this project will help students and engineers to understand how a search engine works.
Feel free to contact us
trouvA License - Creative Commons Attribution Non-Commercial License V2.0.
- Publisher: Trouva
- Date:
- Platform: WinOther
- Able2Know Search Toolbar
- License: Freeware

A volunteer-maintained reference resource with an agressive popup blocker. In addition to blocking pop ups this searchbar offers easy access to the internet's best resources and network tools without collecting an iota of information or bundling any adware, spyware or malware. The Able2Know.com Toolbar is an advanced researcher's tool allowing you to search through dozens of search engines and online references. What sets Able2Know.com's Toolbar apart from others is that it does not restrict searches to one search engine but searches all the best web searches, meta-searches, directories, and references.
- Publisher: Able2Know.com - Ask an Expert
- Date: 07-12-2004
- Size: 238 KB
- Platform: Win2000, Windows CE, WinOther
- Searchblox
- License: Freeware

SearchBlox is completely browser-based. It supports both HTTP and Filesystem-based crawling of documents and can index and search Word, Excel, PowerPoint, PDF, Text, and HTML documents in 17 languages..
- Publisher: Robert Selvaraj
- Date: 20-11-2011
- Platform: JavaScript, Scripts
- ARADO
- License: Freeware

ARADO is an open source Bookmark-Database for Websearch. You easily can save and organize your favourite URLs (Bookmarks). So, Arado Websearch is a complete bookmark management solution that will allow you to synchronize, organize, manage, remove duplicates and check your favorite internet pages, if their content has changed. The database can be networked with your further devices like laptop, mobile phone, home or work PC, so that all added URLs are synchronized with your connected devices. The Arado websearch experience provides to search the web within all your networked devices. 1: Organize Your Bookmarks with Arado Arado is not a company service, it does not track any users.
- Publisher: Arado
- Date: 17-11-2012
- Size: 12595 KB
- Platform: WinOther
- JavaWAC
- License: Freeware

Web-as-corpus tools in Java.* Simple Crawler (and also integration with Nutch and Heritrix)* HTML cleaner to remove boiler plate code* Language recognition* Corpus builder
JavaWAC License - Apache License V2.0.
- Publisher: Javawac
- Date:
- Platform: WinOther
- Carrot2
- License: Freeware

Carrot2 is an Open Source Search Results Clustering Engine. It can automatically organize small collections of documents, e.g. search results, into thematic categories. Carrot2 integrates very well with both Open Source and proprietary search engines.
two high-quality document clustering algorithms, integrates with public and open source search engines (Nutch, Solr, Lucene), easy to integrate with Java and non-Java software, ships a GUI application for tuning clustering for specific collections, ships with a simple web application, native C# / .NET API
Carrot2 License - BSD License.
- Publisher: Carrot2
- Date:
- Size: 19033 KB
- Platform: Linux, Mac OS X, WinOther
- SearchBlox Server
- License: Freeware

SearchBlox provides high-performance content search solutions designed for site search, vertical search, and embedding in custom applications. It offers the best in search technology with utmost flexibility in deployment and customization.
Compared to other products in the market, SearchBlox offers unparalleled ease of use in deploying, configuring and maintaining the search solution. With SearchBlox, search solutions can be implemented in a matter of minutes, instead of months.
With over 300 customers in 28 countries, SearchBlox delivers cost-effective and innovative search solutions backed by excellent support.
- Publisher: SearchBlox Software, Inc
- Date:
- Size: 43540 KB
- Platform: Win2000, WinOther
- WhelanLabs Search Engine Manager
- License: Shareware

A windows-based Internet Search Engine ideally suited for cost-conscious organizations who have private-network content that they want to be searchable by local users. (For example, this could be used as by employees of a company to search corporate web servers.)
The application is built on industry standard components (Apache Nutch and Apache Tomcat), has a simple installation process, is fully integrated into Windows (start menu access, uninstaller), has decent on-line documentation (along with references to additional component documentation), and an on-line forum for users.
This is a good alternative to a Google server for a cost-conscious organization.
- Publisher: Whelanlabs.com
- Date:
- Size: 67399 KB
- Platform: WinOther, WinVista
Nutch: Freeware | All








