This blog post aims to explore Word2Vec, the algorithm we used to generate synonyms in our contribution to Apache Lucene
This blog post aims to explore our contribution to Apache Lucene, which integrates a Word2Vec model to generate synonyms
This blog post aims to give a better understanding of Docvalues and stored fields in Apache Solr for the operations in which they can be used interchangeably.
This flash blog post will address a very specific and common problem : how to manage entities/concepts composed by multiple terms in a vanilla Apache Solr/Elasticsearch instance ( no plugins or extensions to install). The (deployment) context An Elasticsearch or Apache Solr infrastructure where you cannot install third-party components (e.g. plugins, filters, query parsers). This can…
Quantity detection? What is a quantity? And why do we need to detect it? A quantity, as described by Martin Fowler in his “Analysis Patterns” [1] is defined as a pair which combines an amount and unit (such as 30 litres, 0.25 cl, or 140 cm). In search-based applications, there are many cases where you may want to…
Introduction This blog post is about the Lucene Inverted Index and how Apache Solr internally works. When playing with Solr systems, understanding and properly configuring the underline Lucene Index is fundamental to deeply control your search.With a better knowledge of how the index looks like and how each component is used, you can build a…
This blog post is about the Apache Solr Suggester feature. Here you find all the available implementations with examples and tricks and tips.