newsletterlibrary.com

Top : Computers : Software : Information Retrieval :
Fulltext

Websites
Supplier of information retrieval and collaborative software.
site exerpt
Enterprise Content Management Solutions (ECM Open Text Corporation  De nieuwe informatiehuishouding van de Andere Overheid The Trusted Repository Your Enterprise Records Management System November 13 16, 2006Open Text User Conference Enterprise Content Management This trilogy the dictionary of ECM is a must-read for executives interested in compliance, productivity,...
http://www.opentext.com

Software for indexing and searching text documents, using full text and field based search, relevance ranked results, Boolean queries, and heterogeneous databases. Support for document types such as HTML, SGML, mail folders, and USMARC.
http://www.etymon.com/Isearch/

The tools they use at their site for sale. Demo version available for download.
http://software.infoseek.com/products/ultraseek/ultratop.htm

Zebra is a fulltext and free-text indexing and retrieval system that conforms to ANSI standard Z39.50. It is very good for indexing and searching highly structured data such as MARC records, and GILS records. The Zebra server is freely available for noncommercial applications.
http://www.indexdata.dk/zebra

Search engine vendor of BRS/Search, a text based core product, and web enabled products.
http://www.dataware.com

Information and select sections of a book about indexing and compression techniques for documents and images. Also provides information about open source IR system released with the book.
http://www.cs.mu.oz.au/mg/

A Unix based indexing and query system. It is good for indexing relatively small amounts of data. Different types of indexes allow you to trade off search speed for index size. The default search engine used in Harvest.
http://glimpse.cs.arizona.edu/

A project of the IFLA Section on Information Technology
http://www.ifla.org/VII/s21/p1996/fulltext.htm

High speed, fully featured, multilingual fielded fulltext engine. Available for many platforms including Solaris, BSD, Linux and Windows-NT.
http://www.bsn.com/Z39.50

A scholarly paper by Loren G. Terveen, William C. Hill, Brian Amento, David McDonald, and Josh Creter.
http://www.acm.org/sigchi/chi97/proceedings/paper/lgt.htm

Searches all popular file types, with features including hit highlighting, natural language, fuzzy, phonic, boolean, proximity, field, numeric range.
http://www.dtsearch.com/

Provides document scanning, optical character recognition and full-text searching.
http://www.searchexpress.com/

Cheshire II is a "Next-Generation Online Catalog and Full-Text Information Retrieval System." It features advanced IR techniques, including support for Boolean and probabilistic 'best match' ranked searching, SGML/XML as the primary data base format, and a client/server architecture that uses the Z39.50 Information Retrieval Protocol.
http://cheshire.berkeley.edu/

Combine is an open system for harvesting and threshing (indexing) Internet resources.
http://www.lub.lu.se/combine/

SWISH-Enhanced is a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other text files.
site exerpt
Swish-e Home Page  What a bloody fantastic tool. I went from seeing it mentioned in a link on the hypermail website, to downloading, installing, configuring, integrating the PHP, setting up the cron jobs and having it all working within the hour. Now that's...
http://swish-e.org/

Toolkit (SDK) for adding full-text indexing and searching capabilities to applications. Ported to a wide range of platforms and highly scalable. Designed for use in both large and small scale systems. Free evaluation download.
http://www.lextek.com/onix/

End user software for searching your documents and e-mail. AnswerMap finds specific answers to your typed questions in seconds. View short summaries, full passages, or entire files.
http://www.cognitium.com/answermap.html

Non-numerical information storage and retrieval software developed to allow institutions, especially in developing countries, to streamline their information processing activities.
http://portal.unesco.org/ci/ev.p...RL_SECTION=201&reload=1035195531

Jakarta Lucene is a full-featured text search engine written entirely in Java, and it is an open source project available for free download from Apache Jakarta. The current goals of the project are primarily to provide application and also a platform for research.
http://jakarta.apache.org/lucene/

Suite of search software products that finds information in multiple file formats and languages. Features product descriptions, evaluation version download, company profile and contact information.
http://www.isysusa.com