You have just trained a learning to rank model and you now want to know how it performs. You can start by looking at the evaluation parameters returned by the train on the test set, but you are still not sure of which will be the impact in using it in a real website. This … Continue reading The Importance of Online Testing in Learning to Rank – Part 1
This blog post aims to give a better understanding of Docvalues and stored fields in Apache Solr for the operations in which they can be used interchangeably.
I've always loved R&D and I've always been fascinated by seeing in action the implementation of my ideas. It was in July 2010 when I started my professional journey in the Open Source search landscape, I was a Junior software engineer at the time, and after few months of post graduate research at Roma3 University … Continue reading Apache Lucene/Solr Committer !
This blog post is about our latest contribution to the Apache Lucene/Solr project:introducing the ability of assigning different weights to synonyms.This contribution aims to help users that deal with complex synonyms dictionaries where it's important to associate a numerical weight to each of them, for example to boost the ones that are more important in … Continue reading Introducing Weighted Synonyms in Apache Lucene/Solr
In this post we describe what is an Intervals Table and how to build it using a Behaviour-Driven-Development (BDD) approach.
In this post we describe an approach to solve the problem of an application that requires both Full and Atomic Updates, using one of the powerful concepts in Object Oriented Programming: Polymorphism.
After the very warm reception of the first year, the fourth London Information Retrieval Meetup is approaching (11/02/2020) and we are excited to add more details about our speakers and talks!The event is free and you are invited to register : https://www.eventbrite.com/e/london-information-retrieval-meetup-february-2020-tickets-89056738101 Our first speaker is Anna Ruggero, one of our R&D software engineers: Anna … Continue reading London Information Retrieval Meetup February
It was the spring of 2018, Andrea was strenuously working on a customer project, continuously tuning search configurations and checking the ground truth for certain queries manually. That was pretty much the standard at the time, the brilliant Quepid from our friends at Open Source Connection helped in some use cases, but there was nothing … Continue reading Road to Rated Ranking Evaluator Enterprise
This is the last post of the Entity Search with graph embeddings serie. In Part 2 and Part 3 we illustrated the core of the dissertation describing in detail the implementation of our solution pipeline. In this final part we will see some evaluation measures and results. We will draw some conclusions explaining which were … Continue reading Entity Search with graph embeddings – Part 4 – Evaluation and conclusion
In this post we want to get to the heart of the process of virtual documents creation. As we explained in Part 1, these documents are essential for the retrieval phase and for its performances. This part of the pipeline is, indeed, the one where we create our own approaches in many different versions. Summarizing … Continue reading Entity Search with graph embeddings – Part 3 – Documents and retrieval