Online Testing for Learning To Rank: Interleaving

If you have read Part 1 of this blog post, you should know by now how many fantastic things can be done with online testing! In particular, the advantages that interleaving brings with respect to A/B testing, but you are still waiting for the answer to a question: how to implement it? Let's see together … Continue reading Online Testing for Learning To Rank: Interleaving

The Importance of Online Testing in Learning to Rank – Part 1

You have just trained a learning to rank model and you now want to know how it performs. You can start by looking at the evaluation parameters returned by the train on the test set, but you are still not sure of which will be the impact in using it in a real website. This … Continue reading The Importance of Online Testing in Learning to Rank – Part 1

Entity Search with graph embeddings – Part 4 – Evaluation and conclusion

This is the last post of the Entity Search with graph embeddings serie. In Part 2 and Part 3 we illustrated the core of the dissertation describing in detail the implementation of our solution pipeline. In this final part we will see some evaluation measures and results. We will draw some conclusions explaining which were … Continue reading Entity Search with graph embeddings – Part 4 – Evaluation and conclusion

Entity Search with graph embeddings – Part 1 – Overview

This series of blog posts wants to describe my master degree dissertation done with the supervision of Prof. Gianmaria Silvello at the University of Padova. The main focus of this project is in the use of graph embeddings in order to create virtual documents for the Information Retrieval Entity Search task. This thesis description is … Continue reading Entity Search with graph embeddings – Part 1 – Overview

Haystack 2019 Experience

This blog is a quick summary of my (subjective) experience at Haystack 2019 : the Search Relevance Conference, hosted in Charlottesville (Virginia, USA) from 24/04/2019 to 25/04/2019.References to the slides will be updated as soon as they become available. First of all my feedback on the Haystack Conference is extremely positive.From my perspective the conference … Continue reading Haystack 2019 Experience

Apache Solr Distributed Facets

Apache Solr distributed faceting feature has been introduced back in 2008 with the first versions of Solr (1.3 according to this jira[1]) . Until now, I always assumed it just worked, without diving too much into the details. Nowadays distributed search and faceting are extremely popular, you can find them pretty much everywhere (in the … Continue reading Apache Solr Distributed Facets

Synonyms and Stopwords: Vademecum

In this post we'll cover two additional synonyms scenarios and we'll try to summarise all previous tips in a coincise form. Following the approach of the previous posts [1] [2] [3], everything can be applied both to Apache Solr and Elasticsearch. Preconditions Synonyms and stopwords at query time: this is not just a "theoretical" constraint; … Continue reading Synonyms and Stopwords: Vademecum

Still Synonyms + Stopwords?? Mamma mia!

The Context Brief recap of where we arrived in the preceding article: we had the following synonyms and stopwords settings: synonyms = {"out of warranty","oow"} stopwords = {"of"} Both of those filters were configured exclusively at query-time; the synonym filter first and then the stopwords filter. Using the built-in StopFilter we had a synonym detection … Continue reading Still Synonyms + Stopwords?? Mamma mia!