Berkeley Function Calling Leaderboard Updates (v1.3) #1119
ShishirPatil
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Highlights
🏆 Stable release of Berkeley Function Calling Leaderboard V3 with Multi-step and Multi-turn function call evaluation
What's Changed
live_parallel_multiple_9-8-0
copy-paste issue by @pkesseli in [BFCL] Fixlive_parallel_multiple_9-8-0
copy-paste issue #865multi_turn_base_34
Ground Truth by @HuanzhiMao in [BFCL] Fix Typo inmulti_turn_base_34
Ground Truth #876retry_with_backoff
for Amazon Nova Handler by @HuanzhiMao in [BFCL Chore] Implementretry_with_backoff
for Amazon Nova Handler #880live_simple_183-108-0
by @pkesseli in [BFCL] Fixlive_simple_183-108-0
#872live_simple_44-18-0
andlive_simple_45-18-1
by @pkesseli in [BFCL] Fixlive_simple_44-18-0
andlive_simple_45-18-1
#870id
with Result File Test Case IDs by @HuanzhiMao in [BFCL Chore] Align Score Fileid
with Result File Test Case IDs #893o3-mini-2025-01-31
ando3-mini-2025-01-31-FC
by @HuanzhiMao in [BFCL] Add New Modelo3-mini-2025-01-31
ando3-mini-2025-01-31-FC
#898gemini-2.0-flash-001
,gemini-2.0-flash-lite-preview-02-05
,gemini-2.0-pro-exp-02-05
. by @HuanzhiMao in [BFCL] Add New Modelgemini-2.0-flash-001
,gemini-2.0-flash-lite-preview-02-05
,gemini-2.0-pro-exp-02-05
. #902gpt-4.5-preview-2025-02-27
,gpt-4.5-preview-2025-02-27-FC
by @HuanzhiMao in [BFCL] Add New Modelgpt-4.5-preview-2025-02-27
,gpt-4.5-preview-2025-02-27-FC
#922DeepSeek-R1
by @HuanzhiMao in [BFCL] Add New ModelDeepSeek-R1
#901requirements.txt
Location to Remove Global Dependency Confusion by @HuanzhiMao in Fix Gorilla Paperrequirements.txt
Location to Remove Global Dependency Confusion #937deepseek-ai/DeepSeek-R1
by @HuanzhiMao in [BFCL] Support Local Inference fordeepseek-ai/DeepSeek-R1
#926Qwen2.5
Models in Function Calling Mode by @HuanzhiMao in [BFCL] Add Support forQwen2.5
Models in Function Calling Mode #925claude-3-7-sonnet-20250219
,claude-3-7-sonnet-20250219-FC
by @HuanzhiMao in [BFCL] Add New Modelclaude-3-7-sonnet-20250219
,claude-3-7-sonnet-20250219-FC
#923constant.py
Files to aconstants
Folder by @catherineruoxiwu in [BFCL] Reorganized Allconstant.py
Files to aconstants
Folder #944gemini-2.0-flash-lite-001
,gemini-2.0-flash-thinking-exp-01-21
by @HuanzhiMao in [BFCL] Add New Modelsgemini-2.0-flash-lite-001
,gemini-2.0-flash-thinking-exp-01-21
#942Gemma-3
Series Models by @HuanzhiMao in [BFCL] Add GoogleGemma-3
Series Models #939model_metadata.py
toconstants
folder by @catherineruoxiwu in [BFCL] Movemodel_metadata.py
toconstants
folder #949./data/possible_answer
Folder by @catherineruoxiwu in [BFCL] Moved Ground Truths for Executable Tests to./data/possible_answer
Folder #953./bfcl/eval_checker/executable_eval/data/
by @catherineruoxiwu in [BFCL] Reorganizing Codes in./bfcl/eval_checker/executable_eval/data/
#954gemini-2.5-pro
to the Leaderboard by @catherineruoxiwu in [BFCL] Addgemini-2.5-pro
to the Leaderboard #974multi_turn_base_166
Ground Truth. by @HuanzhiMao in [BFCL] Fix Typo inmulti_turn_base_166
Ground Truth. #979Llama-4-Scout
,Llama-4-Maverick
by @HuanzhiMao in [BFCL] Add New ModelsLlama-4-Scout
,Llama-4-Maverick
#981--local-model-path
by @catherineruoxiwu in [BFCL] Add Support for Fully Offline Model Inference via--local-model-path
#985xLAM-2-8b-fc-r
by @HuanzhiMao in Fix Typo in Model Name forxLAM-2-8b-fc-r
#992microsoft/phi-4
to the Leaderboard by @catherineruoxiwu in [BFCL] Addmicrosoft/phi-4
to the Leaderboard #1000writer-sdk
Dependency Version by @HuanzhiMao in Bumpwriter-sdk
Dependency Version #1006live_multiple_1052-279-0
by @itea1001 in [BFCL] fix entry id typo inlive_multiple_1052-279-0
#1022version
tobfcl
CLI by @ShishirPatil in [BFCL] Addversion
tobfcl
CLI #1038Qwen3
Series by @HuanzhiMao in [BFCL] Support DashScope API Inference forQwen3
Series #1061system
role withdeveloper
role for OpenAI models by @errorfourten in [BFCL] Replacesystem
role withdeveloper
role for OpenAI models #1090New Contributors
live_parallel_multiple_9-8-0
copy-paste issue #865constant.py
Files to aconstants
Folder #944Full Changelog: v1.2...v1.3
This discussion was created from the release Berkeley Function Calling Leaderboard Updates (v1.2).
Beta Was this translation helpful? Give feedback.
All reactions