Comment by elcomet - Hacker Neue

elcomet Dec 2, 2024 parent

It's possible, the question is how to choose which submodel will be used for a given query.

You can use a specific LLM, or a general larger LLM to do this routing.

Also, some work suggest using smaller llms to generate multiple responses and use a stronger and larger model to rank the responses (which is much more efficient than generating them)

This item has no comments currently.

It looks like you have JavaScript disabled. This web app requires that JavaScript is enabled. Please enable JavaScript to use this site (or just go read Hacker News).

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous