SearchBlox is an Enterprise Search Server built on top of Apache Lucene and includes: Integrated crawlers for HTTP/HTTPS, filesystems and feeds; Web based Admin Console to configure and manage upto 250 indexes; REST API; Multilingual support to index content in 37 languages; Packaged for deployment to Linux/Unix, Windows, Mac OS X. SearchBlox is suitable for Website Search, Intranet Search, Custom Search and eDiscovery.
SearchBlox Cloud is our cloud-hosted search service. We host, manage and support SearchBlox for you “in the cloud”. There is no software to download or install and no hardware for you to manage.
End-User Features
- Seamlessly search across RSS and Atom Web Feeds, HTTP(S), Filesystem and custom content
- Automatically group search results into clusters (clustered search) for fast access to the right information
- Advanced Search – Search by file format, language, keyword occurrence and modified date
- Spelling Suggestions – Using words from indexed content
- Date Range search – restrict search results to a particular date range
- Automatic highlighting of user search query terms in HTML and PDF documents
- Keyword-in-Context Display – search results are displayed with areas of content where the keyword occurs
- User-defined number of search results per page
- Simple and Advanced Query Syntax
- Supports Boolean AND, OR, and NOT searches, Fuzzy and fielded searches
- Browsable Categories for quick access to categorized content
- Sort – search results can be sorted by date, relevance or alphabetically
- Hit Highlighting – query terms are highlighted on content title and description
- Collections – users can limit search to specific collections
Administrator Features
- AJAX-based Admin Console – easy to use and intuitive console to manage all aspects of the Search application
- Featured Results – Highlight links in the search results page when the user enters specific search terms
- Web-based editor for easy customization of search results
- Fast deployemnt of clustered search results using in-built clustering engine
- Choice of Memory-Based Index (for very fast indexing) or Disk-Based Index (for large document collections)
- Built-in Replication to synchronize search indexes across multiple instances of SearchBlox
- Collections – create up to 250 document collections with customized settings
- Look & Feel – search results customizable using CSS or XSLT stylesheets. Can also be delivered as XML
- Automatic Generation of Browsable Categories using Category metadata in feeds and documents
- Built-in Crawlers to index HTTP, HTTPS, File System, RSS and Atom Web Feed content
- Built-in file serving of documents in File System Collections without URL mapping
- Support for indexing content through Proxy Servers
- Selective indexing of sections of HTML pages using <noindex> </noindex> or <!–stopindex–> <!–startindex–> tags
- Protected Content – crawlers can index content protected with Basic HTTP and Form-Based Authentication
- Reporting – real-time reporting with weekly, daily and hourly top queries and zero match queries for upto 3 months
- On-Demand & Scheduled Indexing of content
- Check for duplicate documents during indexing
- Addition and Deletion of individual documents from the index
- Disable stemming for individual indexes
Supported Languages
SearchBlox can index content in 37 languages
- Arabic
- Bengali
- Chinese(Simplified)
- Chinese(Traditional)
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Gujarati
- Hebrew
- Hindi
- Hungarian
- Italian
- Japanese
- Kannada
- Korean
- Latvian
- Lithuanian
- Malayalam
- Norwegian
- Polish
- Portuguese
- Russian
- Romanian
- Slovak
- Slovenian
- Spanish
- Swedish
- Tamil
- Telugu
- Thai
- Turkish
Supported File Formats
- HTML
- Word
- Excel
- PowerPoint
- PDF
- Text
- RTF
- More coming soon…