Fix llm_filter to support prompts without context_columns #220

anasdorbani · 2025-12-11T16:17:32Z

Fixes an issue where llm_filter only worked when context_columns was provided. Now it supports simple prompts without context columns.

Before (only worked with context_columns):

SELECT llm_filter(
    {'model_name': 'gpt-4o'}, 
    {'prompt': 'Is this sentiment positive?', 'context_columns': [{'data': text}]}
) FROM unnest(['I love this product!']) as tbl(text);

After (now works without context_columns):

SELECT llm_filter(
    {'model_name': 'gpt-4o'}, 
    {'prompt': 'Is paris the best capital in the world?'}
);

The function now handles both cases:

Without context_columns: Makes a single completion request and returns the boolean result
With context_columns: Continues to work as before with batch processing

…dictable tests

…andling

…handlers

…llama providers

…class

…tions

…ting

…king

…system temp

Copilot

Pull request overview

This PR adds comprehensive audio transcription support to the Flock extension. The main purpose is to enable LLM functions (like llm_complete, llm_filter, llm_rerank, etc.) to process audio files by transcribing them first, then using the transcriptions as context for LLM operations.

Key Changes:

Implemented audio transcription functionality via OpenAI's Whisper API
Added URL/file handling utilities for downloading and processing audio files
Extended all LLM functions to support audio context columns with automatic transcription
Added metrics tracking system for monitoring LLM function performance

Reviewed changes

Copilot reviewed 78 out of 81 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
`src/model_manager/providers/adapters/openai.cpp`	Added transcription request handling for OpenAI provider
`src/model_manager/providers/adapters/ollama.cpp`	Added error handling for unsupported transcription in Ollama
`src/model_manager/providers/adapters/azure.cpp`	Added transcription support for Azure provider
`src/prompt_manager/prompt_manager.cpp`	Added audio transcription processing in prompt rendering
`src/include/flock/model_manager/providers/handlers/url_handler.hpp`	New utility for handling file downloads and base64 conversion
`src/metrics/*.cpp`	New metrics tracking system implementation
`test/unit/prompt_manager/prompt_manager_test.cpp`	Added tests for audio transcription functionality
`test/unit/model_manager/model_providers_test.cpp`	Updated model name and added transcription tests
`test/integration/src/integration/conftest.py`	Added audio file path helper and secrets setup
`test/integration/src/integration/tests/functions/*/.py`	Added audio transcription integration tests across all functions

Comments suppressed due to low confidence (3)

test/unit/model_manager/model_providers_test.cpp:1

The model version 'gemma3:4b' does not exist. The Gemma 3 family was released after your knowledge cutoff (January 2025), but typical Gemma model naming uses '2b', '7b', '9b', '27b' parameter counts. The '4b' variant is not a standard Gemma model size. Consider using a valid Gemma model like 'gemma2:2b' or 'gemma2:9b'.
test/unit/model_manager/model_manager_test.cpp:1
The model version 'gemma3:4b' does not exist. Please use a valid Gemma model version such as 'gemma2:2b' or 'gemma2:9b'.
test/integration/src/integration/tests/functions/scalar/test_llm_embedding.py:1
The model 'gpt-4o-mini-transcribe' referenced in other test files does not exist. OpenAI's transcription models use 'whisper-1' as the model identifier. Please ensure transcription model references use the correct model name.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

anasdorbani added 30 commits December 2, 2025 11:40

added flock metrics to all the providers

6bbb82c

upgrade gh action to DuckDB 1.4.2

fa553a1

registered the metrics scalar functions

e0d4c1f

added unit tests for the metrics feature

c79dfcd

added integration tests for the metrics feature

5abddab

Removed old metrics wrapper

b90eac7

Updated metrics registry

2b80454

Updated handlers to use MetricsManager

2965d81

Merged scalar and aggregate metrics tests

19ea07b

Updated metrics CMakeLists

12c2605

Added merged metrics integration tests

7934bad

Fixed code formatting

4de6e21

Fixed include in llm_complete

03f58a4

Replaced old FlockMetrics API call

d86c4c2

Add missing metrics tracking to llm_complete function

5b7d511

Update test prompts to ensure 1-2 word responses for faster, more pre…

12675b2

…dictable tests

Remove legacy MetricsContext class (replaced by MetricsManager)

c66bb3a

Centralized shared standard library includes in common.hpp

7aab760

Add metrics merging for aggregate functions

d0325e8

Add tests for metrics merging

4ad6d1c

Added URLHandler class for file download and validation utilities

ec3824a

Refactored ExecuteBatch to use RequestType enum for unified request h…

50d3189

…andling

Implemented ExtractTranscriptionOutput for OpenAI, Azure, and Ollama …

42264eb

…handlers

Added AddTranscriptionRequest implementation for OpenAI, Azure, and O…

17a6192

…llama providers

Added transcription request methods to IProvider interface and Model …

c0adbc3

…class

Added audio transcription support to prompt manager and input parser

f4be1e2

Added transcription mock methods and OLLAMA secret to test base classes

1b280c5

Added unit tests for audio transcription in llm_complete and llm_filter

15537ff

Added unit tests for audio transcription in aggregate LLM functions

2a3da64

Added unit tests for transcription in model provider adapters

cd2d869

anasdorbani added 21 commits December 10, 2025 11:40

Added integration tests for audio transcription in scalar LLM functions

72aa288

Added integration tests for audio transcription in aggregate LLM func…

fc1d297

…tions

Updated unit test database with audio transcription test data

893be86

Added unit tests for TranscribeAudioColumn and made it public for tes…

64058f3

…ting

Added base64 encoding and regex URL detection to URL handler

ff53e8f

Improved null safety and type checking in prompt manager

8ae1f43

Enhanced error handling and null checks in base handler

29ada0b

Improved aggregate state initialization and metadata handling

e918a81

Added null checks for transcription output in Azure and OpenAI handlers

285ab8d

Updated Ollama handler to use chat API and improved response parsing

1e61924

Updated scalar functions to use unique ID generation for metrics trac…

45cf50f

…king

Updated integration test configuration and database setup

6f02270

Updated integration tests for aggregate LLM functions

58b827b

Updated integration tests for scalar LLM functions

508c1a3

Updated integration tests for metrics, parsers, and secret manager

4a7a5fe

Updated unit tests for scalar LLM functions

9a4fe20

Updated unit tests for model manager and providers

85431b8

Updated unit test database with latest test data

ba54ffa

Added audio test file for integration tests

a171c73

Updated temp file location to use flock storage directory instead of …

4c158a3

…system temp

Fix llm_filter to work without context_columns parameter

9095a07

Copilot AI review requested due to automatic review settings December 11, 2025 16:17

Copilot AI reviewed Dec 11, 2025

View reviewed changes

Add unit and integration tests for llm_filter without context_columns

25036f1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix llm_filter to support prompts without context_columns #220

Fix llm_filter to support prompts without context_columns #220

Uh oh!

anasdorbani commented Dec 11, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix llm_filter to support prompts without context_columns #220

Are you sure you want to change the base?

Fix llm_filter to support prompts without context_columns #220

Uh oh!

Conversation

anasdorbani commented Dec 11, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes:

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant