18k AI Model Beats Top Gamers via Distillation
Li Er's Agentank uses 18k-parameter distillation to rival top Lux AI players, proving small models can compete with LLMs…
2 articles about 'imitation learning'
Li Er's Agentank uses 18k-parameter distillation to rival top Lux AI players, proving small models can compete with LLMs…
Researchers propose RINSE, a method that automatically evaluates the quality of demonstration data in imitation learning…