If you have read Part 1 of this blog post, you should know by now how many fantastic things can be done with online testing! In particular, the advantages that interleaving brings with respect to A/B testing, but you are still waiting for the answer to a question: how to implement it? Let’s see together…
You have just trained a learning to rank model and you now want to know how it performs. You can start by looking at the evaluation parameters returned by the train on the test set, but you are still not sure of which will be the impact in using it in a real website. This…
This blog post aims to give a better understanding of Docvalues and stored fields in Apache Solr for the operations in which they can be used interchangeably.
This blog post is about our latest contribution to the Apache Lucene/Solr project:introducing the ability of assigning different weights to synonyms.This contribution aims to help users that deal with complex synonyms dictionaries where it’s important to associate a numerical weight to each of them, for example to boost the ones that are more important in…
In this post we describe what is an Intervals Table and how to build it using a Behaviour-Driven-Development (BDD) approach.
In this post we describe an approach to solve the problem of an application that requires both Full and Atomic Updates, using one of the powerful concepts in Object Oriented Programming: Polymorphism.
It was the spring of 2018, and Andrea was strenuously working on a customer project, continuously tuning search configurations and checking the ground truth for certain queries manually. That was pretty much the standard at the time, the brilliant Quepid from our friends at Open Source Connection helped in some use cases, but there was…
This is the last post of the Entity Search with graph embeddings serie. In Part 2 and Part 3 we illustrated the core of the dissertation describing in detail the implementation of our solution pipeline. In this final part we will see some evaluation measures and results. We will draw some conclusions explaining which were…
Third part of the journey into Entity Search trough embeddings. Focus of the post is the ranking phase.
Let’s continue our journey into this entity search thesis! In Part 1 we have described what entities and entity search are. We have explained how this search is implemented in the state-of-the-art. We have also introduced the new approach of this dissertation specifying also the dataset and the test collection used. Finally we have described…