PolyMeta Search and Clustering Engine
PolyMeta is an advanced Web 2.0 meta-search (federated search) and clustering engine. PolyMeta enables organizations and individuals to simultaneously search diverse information resources on the Web with a common interface. The search results are merged, ranked and presented in relevance order.
PolyMeta employs powerful Natural Language Processing and Information Retrieval algorithms in its query analysis and refinement, search strategy, relevancy ranking, focused drill-down and exploration of multi-dimensional information spaces.
Try out AllPlus Meta Search and Discovery Engine what is based on PolyMeta, to see what we are capable of!
We are open to implement any kind of federated search engine based on our product.
Contact us, if you have any further questions!
- Multi field parsing and clustering (Dublin Core compliant).
- Allows customers to configure access to an arbitrary number of Web accessible information resources which can be grouped into different search categories as desired by the customer.
- Uses PolySpell spell checker what includes millions of words from PolyMeta's proprietary English Spell Checker, Web Spell Checker, Medical Spell Checker and Scientific Spell Checker.
- Powerful and dynamic conceptual and contextual analysis and clustering of retrieved content based on proprietary linguistic (lexical-morphological, syntactic, semantic and pragmatic) and statistical text processing techniques. Try out text clustering.
- Flexible administrative subsystem to support diverse target database configurations, user access and services and interface options
- Sophisticated proprietary merging and ranking algorithms
- Automatic change detection and reconfiguration of search target databases
- Platform independence (Java technology) and support for multiple operating systems (Windows, Linux, Solaris)
- Robust scalable multi server architecture
- Supported protocols: HTTP, Z39.50
- Searching and clustering API is also available
- Information on the Web is inherently heterogeneous: content is distributed on multiple servers in multiple locations and multiple formats and languages aimed for diverse audiences and purposes
- Even the largest of the search engines, Google or Bing indexes only a portion of all Web pages. You can't find everything on every search engines: See an example
- The “Hidden Web” of content databases (e.g. PubMed, Web of Science) is estimated to be thousands of times larger than the Open Web.
- Both the Open Web and the Hidden Web are characterized by problems of information coverage, quality, overload, relevancy, currency and completeness, as well as language ambiguity and incompatible user interfaces
- Meta-Search Engines may simultaneously search multiple Open Web and Hidden Web sites in order to increase content coverage, precision, relevance and/or search efficiency and effectiveness