How much data need for small VLA fitting

0.5B model,

0.6B model,

LLama3.2 1B model

1.5B model

7B model

epochs, acc.

Training step to success rate

LLama3.2 1B model results is following:

libero spatial

action head is mlp2, same with openvla oft

action head is bottleneck 4 blocks, same with vote paper