Question: What model precision was used during the benchmarks?

#13
by evewashere - opened

INT4 (post QAT), BF16 (Pre-QAT) or FP8 (Pre-QAT)

Sign up or log in to comment