This blog post aims to illustrate step by step a Learning to Rank project on a Daily Song Ranking problem using open source libraries.
Introduction A common problem with machine learning models is their interpretability and explainability.We create a dataset and we train a model to achieve a task, then we would like to understand how the model obtains those results. This is often quite difficult to understand, especially with very complex models. In this blog post, I would…
This blog post aims to give a better understanding of Docvalues and stored fields in Apache Solr for the operations in which they can be used interchangeably.
In this post we describe an approach to solve the problem of an application that requires both Full and Atomic Updates, using one of the powerful concepts in Object Oriented Programming: Polymorphism.
This is the last post of the Entity Search with graph embeddings serie. In Part 2 and Part 3 we illustrated the core of the dissertation describing in detail the implementation of our solution pipeline. In this final part we will see some evaluation measures and results. We will draw some conclusions explaining which were…
This series of blog posts wants to describe my master degree dissertation done with the supervision of Prof. Gianmaria Silvello at the University of Padova. The main focus of this project is in the use of graph embeddings in order to create virtual documents for the Information Retrieval Entity Search task. This thesis description is…
This blog is a quick summary of my (subjective) experience at Haystack 2019 : the Search Relevance Conference, hosted in Charlottesville (Virginia, USA) from 24/04/2019 to 25/04/2019.References to the slides will be updated as soon as they become available. First of all my feedback on the Haystack Conference is extremely positive.From my perspective the conference…
How faceting is calculated in Apache Solr distributed architectures. It presents inner details explanation and practical examples.
In this post we’ll cover two additional synonyms scenarios and we’ll try to summarise all previous tips in a coincise form. Following the approach of the previous posts [1] [2] [3], everything can be applied both to Apache Solr and Elasticsearch. Preconditions Synonyms and stopwords at query time: this is not just a “theoretical” constraint; imagine if you…
This flash blog post will address a very specific and common problem : how to manage entities/concepts composed by multiple terms in a vanilla Apache Solr/Elasticsearch instance ( no plugins or extensions to install). The (deployment) context An Elasticsearch or Apache Solr infrastructure where you cannot install third-party components (e.g. plugins, filters, query parsers). This can…