Berkeley Function Calling Leaderboard Updates (v1.2) #869
ShishirPatil
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Highlights
🏆 Berkeley Function Calling Leaderboard V3 with Multi-step and Multi-turn function call evaluation
What's Changed
o1-preview-2024-09-12
ando1-mini-2024-09-12
by @HuanzhiMao in [BFCL] Add New Modelo1-preview-2024-09-12
ando1-mini-2024-09-12
#635_multi_threaded_inference
by @HuanzhiMao in [BFCL] Robustness Patch for_multi_threaded_inference
#754Llama-3.2-3B-Instruct-FC
andLlama-3.2-1B-Instruct-FC
from Leaderboard by @HuanzhiMao in [BFCL] RemoveLlama-3.2-3B-Instruct-FC
andLlama-3.2-1B-Instruct-FC
from Leaderboard #749data_multi_turn.csv
for Multi-Turn Evaluation Results by @HuanzhiMao in [BFCL Chore] Supplydata_multi_turn.csv
for Multi-Turn Evaluation Results #762record_cost_latency
by @HuanzhiMao in [BFCL] Remove Duplicate Line inrecord_cost_latency
#767claude-3-5-haiku-20241022
,claude-3-5-haiku-20241022-FC
,claude-3-5-sonnet-20241022
,claude-3-5-sonnet-20241022-FC
by @HuanzhiMao in [BFCL] Addclaude-3-5-haiku-20241022
,claude-3-5-haiku-20241022-FC
,claude-3-5-sonnet-20241022
,claude-3-5-sonnet-20241022-FC
#750Qwen/Qwen2.5-72B-Instruct
by @HuanzhiMao in [BFCL] Add New ModelQwen/Qwen2.5-72B-Instruct
#787@final
and@overrides
Decorators to Class Methods in Model Handler by @VishnuSuresh27 in [BFCL Chore] Add@final
and@overrides
Decorators to Class Methods in Model Handler #790@overrides
to@override
by @VishnuSuresh27 in [BFCL Chore] Quick fix change of decorators from@overrides
to@override
#797nova-pro-v1.0
,nova-lite-v1.0
, andnova-micro-v1.0
by @HuanzhiMao in [BFCL] Add Amazon Modelsnova-pro-v1.0
,nova-lite-v1.0
, andnova-micro-v1.0
#815README.md
for Clearer Instructions by @HuanzhiMao in [BFCL Chore] RevampREADME.md
for Clearer Instructions #819Llama-3.3-70B-Instruct
,Llama-3.3-70B-Instruct-FC
by @HuanzhiMao in [BFCL] Add New ModelLlama-3.3-70B-Instruct
,Llama-3.3-70B-Instruct-FC
#837o1-2024-12-17
ando1-2024-12-17-FC
by @HuanzhiMao in [BFCL] Addo1-2024-12-17
ando1-2024-12-17-FC
#840Qwen2.5-0.5B-Instruct
,Qwen2.5-3B-Instruct
,Qwen2.5-14B-Instruct
,Qwen2.5-32B-Instruct
by @HuanzhiMao in [BFCL] AddQwen2.5-0.5B-Instruct
,Qwen2.5-3B-Instruct
,Qwen2.5-14B-Instruct
,Qwen2.5-32B-Instruct
#842watt-tool-8B
andwatt-tool-70B
by @zhanghanduo in [BFCL] Add New Modelwatt-tool-8B
andwatt-tool-70B
#847gemini-2.0-flash-exp-FC
,gemini-2.0-flash-exp
,gemini-exp-1206-FC
,gemini-exp-1206
by @HuanzhiMao in [BFCL] Addgemini-2.0-flash-exp-FC
,gemini-2.0-flash-exp
,gemini-exp-1206-FC
,gemini-exp-1206
#843N/A
in Score Report for Unevaluated Categories by @HuanzhiMao in [BFCL] UseN/A
in Score Report for Unevaluated Categories #849mistralai/Ministral-8B-Instruct-2410
by @HuanzhiMao in [BFCL] Add Mistral Local Serving Handler and Add New Modelmistralai/Ministral-8B-Instruct-2410
#855DeepSeek-V3
by @HuanzhiMao in [BFCL] Add New ModelDeepSeek-V3
#857proprietary_model
->api_inference
,oss_model
->local_inference
for Better Clarity by @HuanzhiMao in [BFCL] Rename Directories:proprietary_model
->api_inference
,oss_model
->local_inference
for Better Clarity #859New Contributors
watt-tool-8B
andwatt-tool-70B
#847Full Changelog: v1.1...v1.2
This discussion was created from the release Berkeley Function Calling Leaderboard Updates (v1.2).
Beta Was this translation helpful? Give feedback.
All reactions