← kwindla hultman kramer

This is true for function calling (tool use), too

November 12, 2024

This is true for function calling (tool use), too.

Function names influence tool call behavior quite a lot. https://t.co/vanyOWZz73

jason@jxnlco

Changing a single field name in our LLM response schema improved accuracy from 4.5% to 95% on GSM8k.

The fix was simple: going from final_choice to final_answer. Turns out our model was returning a multiple-choice index instead of the actual answer.

If you're working with