Yes the model is exactly what you mentioned, but it is the quantized version of that.

Does this article discuss fine-tuning LLaMA 3.1-b8 with a 128K context length?
16
1
HugoLin
Karan Kaul | カラン
·Follow
Sep 11, 2024
--
Yes the model is exactly what you mentioned, but it is the quantized version of that. The context length is 128k indeed for these family of models.

If I misunderstood your question, please elaborate.
--
--
Written by Karan Kaul | カラン246 Followers
·28 Following
Writes about Machine Learning/Data Science. Say Hi on LinkedIn - https://www.linkedin.com/in/krnk97/
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams