Main Blog, RRE

Query Discovery in RRE Enterprise

The role of the "Intruder" layers

One of the things that make RRE, the open source version, very fast and immediate to use, is a direct communication with the target search engine.

That means a search engineer, within the IDE, can use RRE to bind and test a set of ratings directly towards a Solr or an Elasticsearch instance.

While this is undoubtedly pragmatic for concretely improving the search quality of the system under test, it introduces a strong compromise: the queries defined in the system, that will be executed for creating search quality metrics are not “user queries“: they need to be declared using the native search engine language. Here’s an example of an Elasticsearch query:

				
					{
  "query": {
    "match": {
      "name": {
        "query": "$query",
        "minimum_should_match": "3<-75% 9<-85%"
      }
    }
  }
}

Again, that establishes a powerful and direct connection with the target search engine but…in our experience that doesn’t reflect the extremely used/proven three-tiers architecture that distributes system responsibilities among:

a client application: typically a frontend layer(e.g., AngularJS or ReactJS)
an API layer: an intermediate component (actually a set of components in the case of a micro-services architecture) that is in charge to hide, abstracting, and implementing the system logic by coordinating and orchestrating the internal subsystems (e.g. an RDBMS, a search engine, a NoSQL storage).
a “datasource” layer, which consists of one or multiple storage subsystems. Each of them manages data, even the same data in some cases, for serving different purposes.

Martin Fowler, in his famous book “Patterns of Enterprise Application Architecture” describes that layered architecture as composed of the Presentation, the Domain, and the Data Source layers.

Back to our search quality context, that means in a usual architecture, an “intruder” (the API layer) intermediates between the user query and the corresponding search engine query.

The API layer implements the system logic. That means starting from a request (in this case let’s simplify and call it a Search API request) there’s business workflow which triggers several actions involving several components.

For the search engine, that means the API layer builds and executes a search-engine specific query, for example taking in account ACL, permission filters, boosting logic and so on.

If we want to actually measure the search quality of a system like that, is it correct to discard the role the API layer plays? Definitely not. An unfortunately, that is exactly what RRE, the open source version, does.

Ideally, I would like to be able to consider the whole system, including the API layer, as something to measure.

RRE Enterprise: the Query Discovery

RRE Enterprise fills the missing gap described above by implementing a query discovery mechanism. How it works? Without entering in technical details, the underlying idea is pretty simple:

Do not consider the presentation layer in the evaluation process
Split the Query entity in two related requests: the Search API request and the Search Engine request
Trigger a Search API request towards the Search API Layer
Capture the corresponding Search Engine request (on the Search Engine side)
Store the correlation between them

At the end, a rating definition will therefore include all the relevant pieces that contributed to a given query execution, including the Search API request and the corresponding the Search Engine requests

RREE implements the query discovery described above both in Apache Solr and Elasticsearch.

The only assumption required for a successful discovery is to have, during that process, an exclusive access to the target search engine.

As you can imagine, if there is some other process that is using the search engine, it would be very hard in the correlation phase to distinguish between queries executed as consequence of RREE discovery and other applications.

Recap

RREE Query Discovery is a crucial component of the evaluation infrastructure: it allows to consider the Search application as a whole, therefore targeting a system under evaluation strictly close to the real production environment.

It does so by including in the evaluation process the business and system logic carried out by intermediate API layers, which are a crucial part of the Search application.

The purpose is to maximize the “trustability” of the evaluation process output.

Need Help With This Topic?

If you’re struggling with query discovery, don’t worry – we’re here to help! Our team offers expert services and training to help you optimize your search engine and get the most out of your system. Contact us today to learn more!

Need Help with this topic?

If you're struggling with query discovery, don't worry - we're here to help! Our team offers expert services and training to help you optimize your search engine and get the most out of your system. Contact us today to learn more!

Click Here

elasticsearch, rre enterprise, rree, search quality evaluation, solr

Sign up for our Newsletter

Did you like this post? Don’t forget to subscribe to our Newsletter to stay always updated in the Information Retrieval world!

About the company

about our work

Rated Ranking Evaluator
(RRE)

Rated Ranking Evaluator Enterprise (RREE)

Apache Solr LLM Highlighter plugin

News

Main Blog

TIPS AND TRICKS

LATEST BLOG POST

contact us

Don't miss all the news - subscribe to our newsletter!

Query Discovery in RRE Enterprise

The role of the "Intruder" layers

RRE Enterprise: the Query Discovery

Recap

Need Help With This Topic?

Need Help with this topic?

Other posts you may find useful

Elasticsearch Neural Search Improvements in 8.6 and 8.7

Solr Document Classification – Part 1 – Indexing Time

Apache Solr: Chaining SearchHandler instances: the CompositeRequestHandler

Andrea Gazzarini

Andrea Gazzarini

Follow Us

Top Categories

Recent Posts

Scalar Quantization of Dense Vectors in Apache Solr

Retrieval and Responsibility: The Ethics of Augmented Knowledge

Faster Vector Search: Early Termination Strategy Now in Apache Solr

Monthly video

Sign up for our Newsletter

Leave a Reply Cancel reply

Quick Links

Services

Subscribe

About the company

about our work

Rated Ranking Evaluator (RRE)

Rated Ranking Evaluator Enterprise (RREE)

Apache Solr LLM Highlighter plugin

News

Main Blog

TIPS AND TRICKS

LATEST BLOG POST

contact us

Don't miss all the news - subscribe to our newsletter!

Query Discovery in RRE Enterprise

The role of the "Intruder" layers

RRE Enterprise: the Query Discovery

Recap

Need Help With This Topic?​​

Need Help with this topic?​

Other posts you may find useful

Elasticsearch Neural Search Improvements in 8.6 and 8.7

Solr Document Classification – Part 1 – Indexing Time

Apache Solr: Chaining SearchHandler instances: the CompositeRequestHandler

Andrea Gazzarini

Andrea Gazzarini

Follow Us

Top Categories

Recent Posts

Scalar Quantization of Dense Vectors in Apache Solr

Retrieval and Responsibility: The Ethics of Augmented Knowledge

Faster Vector Search: Early Termination Strategy Now in Apache Solr

Monthly video

Sign up for our Newsletter

Leave a Reply Cancel reply

Rated Ranking Evaluator
(RRE)

Need Help With This Topic?

Need Help with this topic?