Openindex inside

Built with advanced search engine technology by openindex

Privacy guaranteed

Openindex is all about respecting privacy. All data collected is stored anonymously. No data is sold or provided to third parties.

Extract numerous entities from your text

Automatically finds people, brands, locations, time indications and much more.

Works on any website and text source

It doesn’t matter what technology is behind the websites or data sources, as long as the output is readable by our parser. The input can be any accessible website, but also text files, PDFs, etc.

Support for multiple languages

We currently support forty-eight languages for EntitySearch, but additional language models are being built. It supports about forty different entity types and integrates seamlessly with our parser.

API available

A web API is available through which Entitysearch can be queried. This allows the technology to be integrated into automated processes.

Try the online demo below

Enter a URL or free text in the text box below and see what information is extracted.


{{ content }}

Entitysearch uses the following techniques

Apache OpenNLP

Apache OpenNLP is a machine learning based toolkit for the processing of natural language text.

Apache Solr

Solr is the popular, blazing-fast, open source enterprise search platform built on Apache Lucene™. Openindex has its own highly customized and optimized Solr instance that serves as the base of our platform.


A perceptron is a neural network in which the neurons are connected in different layers. A first layer consists of input neurons, where the input signals are applied.

Maximum Entropy

The principle of maximum entropy states that the probability distribution that best reflects the current state of knowledge is the one with the greatest entropy, in the context of accurately stated previous data.

PoS tagging

In corpus linguistics, part-of-speech tagging, also known as grammatical tagging, is the marking of a word in a text corresponding to a particular word portion, based on both the definition and the context.

More information

If you are interested in our EntitySearch please contact us to discuss the possibilities.

Entitysearch is available in the following languages:

  • Afrikaans
  • العربية Arabic
  • Български, Bulgarian
  • Català, Catalan
  • Česky, Czech
  • Dansk, Danish
  • Deutsch, German
  • Ελληνικά, Greek
  • English
  • Español, Spanish
  • Eesti, Estonian
  • فارسی, Farsi
  • Suomi, Finnish
  • Français, French
  • Frysk, West Frisian
  • Gaeilge, Irish
  • Gàidhlig, Scottish Gaelic
  • Galego, Galician
  • עברית, Hebrew
  • हिन्दी, Hindi
  • Hrvatski, Croatian
  • Magyar, Hungarian
  • Bahasa Indonesia, Indonesian
  • Íslenska, Icelandic
  • Italiano, Italian
  • 日本語, Japanese
  • 한국어, Korean
  • Lietuvių, Lithuanian
  • Latviešu, Latvian
  • मराठी, Marathi
  • Malti, Maltese
  • Nederlands, Dutch
  • Norsk (bokmål), Norwegian
  • Polski, Polish
  • Português, Portuguese
  • Română, Romanian
  • Русский, Russian
  • Slovenčina, Slovak
  • Slovenščina, Slovenian
  • српски, Serbian
  • Svenska, Swedish
  • தமிழ், Tamil
  • ภาษาไทย, Thai
  • Tagalog
  • Türkçe, Turkish
  • Українська, Ukrainian
  • اردو, Urdu
  • Tiếng Việt, Vietnamese
  • 中文, Chinese