It would help to get a sense of the RAM / Compute requirements + typical perf in tok/s for those releases in the Model Card, even if it's just benchmarked on one given machine (e.g. 78 tok/s on M4 32GB), to know how better which to pick.
· Sign up or log in to comment