London Information Retrieval Meetup February

After the very warm reception of the first year, the fourth London Information Retrieval Meetup is approaching (11/02/2020) and we are excited to add more details about our speakers and talks!The event is free and you are invited to register : https://www.eventbrite.com/e/london-information-retrieval-meetup-february-2020-tickets-89056738101 Our first speaker is Anna Ruggero, one of our R&D software engineers: Anna … Continue reading London Information Retrieval Meetup February

Entity Search with graph embeddings – Part 1 – Overview

This series of blog posts wants to describe my master degree dissertation done with the supervision of Prof. Gianmaria Silvello at the University of Padova. The main focus of this project is in the use of graph embeddings in order to create virtual documents for the Information Retrieval Entity Search task. This thesis description is … Continue reading Entity Search with graph embeddings – Part 1 – Overview

London Information Retrieval Meetup October

After the very warm reception of the first and second edition, the third London Information Retrieval Meetup is approaching (21/10/2019) and we are excited to add more details about our speakers and talks!The event is free and you are invited to register : https://www.eventbrite.com/e/london-information-retrieval-meetup-october-tickets-74403100677 Our second speaker is Andrea Gazzarini, our founder and software engineer: … Continue reading London Information Retrieval Meetup October

London Information Retrieval Meetup June

After the very warm reception of the first edition, the second London Information Retrieval Meetup is approaching (25/06/2019) and we are excited to add more details about our speakers and talks!The event is free and you are invited to register : https://www.eventbrite.com/e/london-information-retrieval-meetup-june-tickets-62261343354Our first speaker is René Kriegler, freelance search consultant and search engineer : René Kriegler René has … Continue reading London Information Retrieval Meetup June

Apache Solr Distributed Facets

Apache Solr distributed faceting feature has been introduced back in 2008 with the first versions of Solr (1.3 according to this jira[1]) . Until now, I always assumed it just worked, without diving too much into the details. Nowadays distributed search and faceting are extremely popular, you can find them pretty much everywhere (in the … Continue reading Apache Solr Distributed Facets

Synonyms and Stopwords: Vademecum

In this post we'll cover two additional synonyms scenarios and we'll try to summarise all previous tips in a coincise form. Following the approach of the previous posts [1] [2] [3], everything can be applied both to Apache Solr and Elasticsearch. Preconditions Synonyms and stopwords at query time: this is not just a "theoretical" constraint; … Continue reading Synonyms and Stopwords: Vademecum

Still Synonyms + Stopwords?? Mamma mia!

The Context Brief recap of where we arrived in the preceding article: we had the following synonyms and stopwords settings: synonyms = {"out of warranty","oow"} stopwords = {"of"} Both of those filters were configured exclusively at query-time; the synonym filter first and then the stopwords filter. Using the built-in StopFilter we had a synonym detection … Continue reading Still Synonyms + Stopwords?? Mamma mia!

Synonyms + Stopwords?? OMG!

The Context The scenario description is quite simple: we want to use synonyms and stopwords. Following the path of our previous article, we will introduce an additional component in the analysis chain: a StopFilter, which, as the name suggests, removes a set of words from an incoming token stream. We will use the following data … Continue reading Synonyms + Stopwords?? OMG!