This series of blog posts wants to describe my master degree dissertation done with the supervision of Prof. Gianmaria Silvello at the University of Padova. The main focus of this project is in the use of graph embeddings in order to create virtual documents for the Information Retrieval Entity Search task. This thesis description is…
When using nested documents and the Apache Solr Block Join functionality it is a common requirement to query for an entity (for example the parent entity) and then retrieve for each search result all(or some of) the related children. Let’s see the most important aspects of such functionality and how to apply complex queries when…
A Software Engineer is always required to give his customers a concrete evidence about deliverables quality. A Search Engineer deals with a specialisation of such generic Software Quality, which is called Search Quality. What is Search Quality? And why is it so important in a search infrastructure? After all, the “Software Quality” should be omni-comprensive,…
// our service Shameless plug for our training and services! Did I mention we do Apache Solr Beginner and Elasticsearch Beginner training?We also provide consulting on these topics, get in touch if you want to bring your search engine to the next level! // STAY ALWAYS UP TO DATE Subscribe to our newsletter Did you like this post about the…
Scenario You’re working as a search engineer for XYZ Ltd, a company which sells electric components. XYZ provided you the application logs of the last six months, and some business requirements. Two kinds of customers, two kinds of requirements, two kinds of search The log analysis shows that XYZ has mainly two kinds of customers:…
Quantity detection? What is a quantity? And why do we need to detect it? A quantity, as described by Martin Fowler in his “Analysis Patterns” [1] is defined as a pair which combines an amount and unit (such as 30 litres, 0.25 cl, or 140 cm). In search-based applications, there are many cases where you may want to…
What are “Invisible Queries”? This is an extract of an article [1] on Lucidworks.com, by Grant Ingersoll, talking about invisible queries: “It is often necessary in many applications to execute more than one query for any given user query. For instance, in applications that require very high precision (only good results, forgoing marginal results), the…
Last Stage Of The Journey This blog post is about the Apache Solr Learning To Rank ( LTR ) integration. We modelled our dataset, we collected the data and refined it in Part 1 .Trained the model in Part 2 .Analysed and evaluate the model and training set in Part 3 .We are ready to…
This blog post is about the Apache Solr Learning to Rank Tools : a set of utilities for the Solr LTR integration.
Model Training For Apache Solr Learning To Rank If you want to train a model for Apache Solr Learning To Rank, you are in the right place.This blog post is about the model training phase for the Apache Solr Learning To Rank integration. We modelled our dataset, collected the data and refined it in Part…