Add configurable question type distribution for query generation #146

ofermend · 2025-11-08T23:55:36Z

Summary

Added configurable question type distribution control for query generation
Users can now customize the percentage of each question type (directly answerable, reasoning required, unanswerable, partially answerable)
Weights are automatically normalized and validated
Setting a weight to 0 disables that question type entirely
Updated documentation and configuration examples

Changes

Core Implementation (`open_rag_eval/query_generation/llm_generator.py:27-376`)

Added question_type_weights parameter to LLMQueryGenerator.__init__() with configurable distribution
Implemented weight validation to ensure non-negative values and at least one active type
Implemented weight normalization to convert raw weights to percentages (sum to 100%)
Added dynamic prompt building based on enabled question types and their percentages
Improved questions-per-doc calculation with 1.5x buffer for deduplication/filtering

Configuration & Documentation

Updated README.md with comprehensive examples and explanation of the feature
Added question_types section to all config examples (CSV, local, Vectara)
Documented how to disable question types by setting weight to 0

Integration (`open_rag_eval/run_query_generation.py:201-237`)

Integrated question type weights from config into generator initialization
Added OmegaConf to dict conversion for proper weight handling
Simplified document loading parameter passing

Testing (`tests/query_generation/test_llm_generator.py`)

Added 10 comprehensive test cases covering:
- Default weight behavior (equal distribution)
- Custom weight normalization
- Auto-normalization of arbitrary weights
- Disabling question types with zero weights
- Error handling for invalid weights (all zeros, negative values, invalid keys)
- Prompt generation with enabled/disabled types
- Partial weight specification

…per category (roughly)

Copilot

Pull Request Overview

This PR adds configurable question type weighting to the LLM query generator, allowing users to control the distribution of different question types (directly answerable, reasoning required, unanswerable, and partially answerable) in the generated queries.

Key changes:

Added question_type_weights parameter to LLMQueryGenerator with weight validation and normalization
Replaced hardcoded question type prompts with dynamically generated instructions based on configured weights
Updated configuration examples and documentation to demonstrate the new feature

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
open_rag_eval/query_generation/llm_generator.py	Implements question type weighting with validation, normalization, and dynamic prompt generation
open_rag_eval/run_query_generation.py	Integrates question type weights from config, converts OmegaConf to dict, and refactors to inline parameter passing
tests/query_generation/test_llm_generator.py	Adds comprehensive test coverage for weight validation, normalization, partial specification, and prompt generation
config_examples/query_generation_vectara.yaml	Adds example configuration for question type weights
config_examples/query_generation_local.yaml	Adds example configuration with additional commented example showing how to disable types
config_examples/query_generation_csv.yaml	Adds example configuration for question type weights
README.md	Documents the new question type customization feature with examples

Comments suppressed due to low confidence (1)

open_rag_eval/run_query_generation.py:1

The magic number 1.5 is unexplained. Consider extracting this as a named constant (e.g., DEDUPLICATION_BUFFER_MULTIPLIER = 1.5) to improve code clarity and make it easier to adjust if needed.

"""Main orchestration script for query generation."""

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

open_rag_eval/query_generation/llm_generator.py

Co-authored-by: Copilot <[email protected]>

Copilot

Pull Request Overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-11T02:25:17Z

open_rag_eval/query_generation/llm_generator.py

+        total = sum(full_weights.values())
+        if total == 0:
+            raise ValueError("Cannot normalize weights: all weights are zero")
+        return {key: (value / total) * 100 for key, value in full_weights.items()}


Missing blank line between method definitions. According to PEP 8, there should be two blank lines between method definitions in a class.

Suggested change

return {key: (value / total) * 100 for key, value in full_weights.items()}

return {key: (value / total) * 100 for key, value in full_weights.items()}

Copilot · 2025-11-11T02:25:18Z

open_rag_eval/query_generation/llm_generator.py

+        # Check that at least one weight is positive
+        if all(v == 0 for v in weights.values()):
+            raise ValueError(
+                "At least one question type weight must be greater than 0"
+            )
+


Inconsistent validation behavior for partial weight specifications. The _validate_weights method at line 104 checks if all values in the provided weights dict are zero, but _normalize_weights fills missing keys with 0. This means if a user provides {'directly_answerable': 0} (only one key), it will pass validation (since it's checking the provided dict), but when normalized, all four types will have 0 weight, causing the normalization to fail with "Cannot normalize weights: all weights are zero" at line 124. The validation should check the filled weights dict to provide a consistent error message.

Suggested change

# Check that at least one weight is positive

if all(v == 0 for v in weights.values()):

raise ValueError(

"At least one question type weight must be greater than 0"

)

# Check that at least one weight is positive (after filling missing keys with 0)

full_weights = {key: weights.get(key, 0) for key in valid_keys}

if all(v == 0 for v in full_weights.values()):

raise ValueError(

"At least one question type weight must be greater than 0"

)

Copilot · 2025-11-11T02:25:18Z

tests/query_generation/test_llm_generator.py

+        self.assertEqual(percentages.get('unanswerable', 0), 0)
+        self.assertEqual(percentages.get('partially_answerable', 0), 0)


[nitpick] The .get() method with default value is unnecessary here. The _normalize_weights method always returns all four question type keys (see lines 119-121 in llm_generator.py where it fills in missing keys). These assertions can be simplified to self.assertEqual(percentages['unanswerable'], 0) and self.assertEqual(percentages['partially_answerable'], 0).

Suggested change

self.assertEqual(percentages.get('unanswerable', 0), 0)

self.assertEqual(percentages.get('partially_answerable', 0), 0)

self.assertEqual(percentages['unanswerable'], 0)

self.assertEqual(percentages['partially_answerable'], 0)

now query generation can be configured to control the % of questions …

518211e

…per category (roughly)

ofermend requested a review from Copilot November 8, 2025 23:55

Copilot AI reviewed Nov 8, 2025

View reviewed changes

open_rag_eval/query_generation/llm_generator.py Outdated Show resolved Hide resolved

ofermend and others added 2 commits November 8, 2025 22:00

updated to read env from .env file

17e7b41

Update open_rag_eval/query_generation/llm_generator.py

dbf58bb

Co-authored-by: Copilot <[email protected]>

ofermend requested review from Copilot and vish119 November 11, 2025 02:22

Copilot started reviewing on behalf of ofermend November 11, 2025 02:22 View session

Copilot finished reviewing on behalf of ofermend November 11, 2025 02:24

Copilot AI reviewed Nov 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add configurable question type distribution for query generation #146

Add configurable question type distribution for query generation #146

ofermend commented Nov 8, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 11, 2025

Uh oh!

Copilot AI Nov 11, 2025

Uh oh!

Copilot AI Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	return {key: (value / total) * 100 for key, value in full_weights.items()}
	return {key: (value / total) * 100 for key, value in full_weights.items()}

		self.assertEqual(percentages.get('unanswerable', 0), 0)
		self.assertEqual(percentages.get('partially_answerable', 0), 0)

Add configurable question type distribution for query generation #146

Are you sure you want to change the base?

Add configurable question type distribution for query generation #146

Conversation

ofermend commented Nov 8, 2025

Summary

Changes

Core Implementation (open_rag_eval/query_generation/llm_generator.py:27-376)

Configuration & Documentation

Integration (open_rag_eval/run_query_generation.py:201-237)

Testing (tests/query_generation/test_llm_generator.py)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Core Implementation (`open_rag_eval/query_generation/llm_generator.py:27-376`)

Integration (`open_rag_eval/run_query_generation.py:201-237`)

Testing (`tests/query_generation/test_llm_generator.py`)