Event, News

Sease at 31st Lucene/Solr Meetup Japan

31st Lucene/Solr Meetup Japan

The Lucene/Solr Meetup is a community event in Tokyo, Japan, where professionals and enthusiasts passionate about Information Retrieval convene to discuss the latest trends and hot topics in the realm of Information Retrieval.

Location: Toranomon Hills Business Tower CIC TOKYO 16F, 1-17-1 Toranomon, Minato-ku, Tokyo
Date: 16th October 2024

// our talk

Hybrid Search With Apache Solr Reciprocal Rank Fusion

16th october, tokyo

Vector-based search gained incredible popularity in the last few years: Large Language Models fine-tuned for sentence similarity proved to be quite effective in encoding text to vectors and representing some of the semantics of sentences in a numerical form. These vectors can be used to run a K-nearest neighbour search and look for documents/paragraphs close to the query in a n-dimensional vector space, effectively mimicking a similarity search in the semantic space (Apache Solr KNN Query Parser). Although exciting, vector-based search nowadays still presents some limitations: – it’s very difficult to explain (e.g. why is document A returned and why at position K?) – it doesn’t care about exact keyword matching (and users still rely on keyword searches a lot) Hybrid search comes to the rescue, combining lexical (traditional keyword-based) search with neural (vector-based) search. So, what does it mean to combine these two worlds? It starts with the retrieval of two sets of candidates:

– one set of results coming from lexical matches with the query keywords

– a set of results coming from the K-Nearest Neighbours search with the query vector

The result sets are merged and a single ranked list of documents is returned to the user. Reciprocal Rank Fusion (RRF) is one of the most popular algorithms for such a task. This talk introduces the foundation algorithms involved with RRF and walks you through the work done to implement them in Apache Solr, with a focus on the difficulties of the process, the distributed support(SolrCloud), the main components affected and the limitations faced. The audience is expected to learn more about this interesting approach, the challenges in it and how the contribution process works for an Open Source search project as complex as Apache Solr.

// our speaker

Alessandro Benedetti

DIRECTOR @ SEASE

APACHE LUCENE/SOLR COMMITTER

APACHE SOLR PMC MEMBER

Other posts you may find useful

opensearch neural search plugin tutorial

We are Sease, an Information Retrieval Company based in London, focused on providing R&D project guidance and implementation, Search consulting services, Training, and Search solutions using open source software like Apache Lucene/Solr, Elasticsearch, OpenSearch and Vespa.

Sign up for our Newsletter

Did you like this post? Don’t forget to subscribe to our Newsletter to stay always updated in the Information Retrieval world!

About the company

about our work

Rated Ranking Evaluator
(RRE)

Rated Ranking Evaluator Enterprise (RREE)

Apache Solr LLM Highlighter plugin

News

Main Blog

TIPS AND TRICKS

LATEST BLOG POST

contact us

Don't miss all the news - subscribe to our newsletter!

Sease at 31st Lucene/Solr Meetup Japan

31st Lucene/Solr Meetup Japan

// our talk

Hybrid Search With Apache Solr Reciprocal Rank Fusion

16th october, tokyo

// our speaker

Alessandro Benedetti

Other posts you may find useful

OpenSearch Neural Search Plugin Tutorial

Apache Solr autoGeneratePhraseQueries and Schema

Music Information Retrieval: the Intervals Matrix

Lisa Biella

Lisa Biella

Follow Us

Top Categories

Recent Posts

Hybrid Search Using a Custom Algorithm in Apache Solr

Hybrid Search with Reciprocal Rank Fusion in Apache Solr

Apache Solr Multivalued Vectors Tutorial

Monthly video

Sign up for our Newsletter

Leave a Reply Cancel reply

Quick Links

Services

Subscribe

About the company

about our work

Rated Ranking Evaluator (RRE)

Rated Ranking Evaluator Enterprise (RREE)

Apache Solr LLM Highlighter plugin

News

Main Blog

TIPS AND TRICKS

LATEST BLOG POST

contact us

Don't miss all the news - subscribe to our newsletter!

Sease at 31st Lucene/Solr Meetup Japan

31st Lucene/Solr Meetup Japan

// our talk

Hybrid Search With Apache Solr Reciprocal Rank Fusion

16th october, tokyo

// our speaker

Alessandro Benedetti

Other posts you may find useful

OpenSearch Neural Search Plugin Tutorial

Apache Solr autoGeneratePhraseQueries and Schema

Music Information Retrieval: the Intervals Matrix

Lisa Biella

Lisa Biella

Follow Us

Top Categories

Recent Posts

Hybrid Search Using a Custom Algorithm in Apache Solr

Hybrid Search with Reciprocal Rank Fusion in Apache Solr

Apache Solr Multivalued Vectors Tutorial

Monthly video

Sign up for our Newsletter

Leave a Reply Cancel reply

Rated Ranking Evaluator
(RRE)