Discussion about this post

User's avatar
Jeremy Greer's avatar

The ability to match or exceed GPT-4 performance on a 7B cannot be overstated - something that also tends to help is a fine-tuned models ability to consistently produce outputs in the correct format. System instructions only do so much...

Expand full comment
Nitin Surya's avatar

Are there any studies available in the impact of lora weights on inferencing performance, e.g. ttft or tpot?

Expand full comment

No posts

Ready for more?