Apache Lucene Apache Solr Elasticsearch Information Retrieval Main Blog Meetup

London Information Retrieval Meetup October

After the very warm reception of the first and second edition, the third London Information Retrieval Meetup is approaching (21/10/2019) and we are excited to add more details about our speakers and talks!The event is free and you are invited to register : https://www.eventbrite.com/e/london-information-retrieval-meetup-october-tickets-74403100677 Our second speaker is Andrea Gazzarini, our founder and software engineer:…

Apache Lucene Apache Solr Block Join Main Blog nested documents SolrJ

Apache Solr ChildDocTransformerFactory: How to Build Complex ChildFilter Queries

When using nested documents and the Apache Solr Block Join functionality it is a common requirement to query for an entity (for example the parent entity) and then retrieve for each search result all(or some of) the related children. Let’s see the most important aspects of such functionality and how to apply complex queries when…

Apache Lucene Apache Solr Elasticsearch Information Retrieval Main Blog Meetup

London Information Retrieval Meetup June

After the very warm reception of the first edition, the second London Information Retrieval Meetup is approaching (25/06/2019) and we are excited to add more details about our speakers and talks!The event is free and you are invited to register : https://www.eventbrite.com/e/london-information-retrieval-meetup-june-tickets-62261343354 Our first speaker is René Kriegler, freelance search consultant and search engineer : René Kriegler René…

Apache Lucene Apache Solr Elasticsearch Enterprise Search Learning To Rank Main Blog Search

Haystack 2019 Experience

This blog is a quick summary of my (subjective) experience at Haystack 2019 : the Search Relevance Conference, hosted in Charlottesville (Virginia, USA) from 24/04/2019 to 25/04/2019.References to the slides will be updated as soon as they become available. First of all my feedback on the Haystack Conference is extremely positive.From my perspective the conference…

Apache Lucene Apache Solr Learning To Rank Main Blog

London Information Retrieval Meetup

The London Information Retrieval Meetup is approaching (19/02/2019) and we are excited to add more details about the speakers and talks!The event is free and you are invited to register :https://www.eventbrite.com/e/information-retrieval-meetup-tickets-54542417840 After Sambhav Kothari, software engineer at Bloomberg and Elia Porciani, R&D software engineer at Sease, our last speaker is Andrea Gazzarini, founder and software engineer at…

Analysis Apache Solr Distributed Search facet faceting Information Retrieval Search SolrCloud Tips And Tricks Uncategorized

Apache Solr Distributed Facets

Apache Solr distributed faceting feature has been introduced back in 2008 with the first versions of Solr (1.3 according to this jira[1]) . Until now, I always assumed it just worked, without diving too much into the details. Nowadays distributed search and faceting are extremely popular, you can find them pretty much everywhere (in the…

Analysis Apache Lucene Apache Solr Elasticsearch Information Retrieval Invisible Queries Search SearchHandler Solr schema Synonyms Tips And Tricks

Synonyms and Stopwords: Vademecum

In this post we’ll cover two additional synonyms scenarios and we’ll try to summarise all previous tips in a coincise form. Following the approach of the previous posts [1] [2] [3], everything can be applied both to Apache Solr and Elasticsearch. Preconditions Synonyms and stopwords at query time: this is not just a “theoretical” constraint;…

Analysis Apache Lucene Apache Solr Elasticsearch Information Retrieval Search Solr schema Synonyms Tips And Tricks

Still Synonyms + Stopwords?? Mamma mia!

The Context Brief recap of where we arrived in the preceding article: we had the following synonyms and stopwords settings: synonyms = {“out of warranty”,”oow”} stopwords = {“of”} Both of those filters were configured exclusively at query-time; the synonym filter first and then the stopwords filter. Using the built-in StopFilter we had a synonym detection…