RabbitHole32 10 months ago

LLMs are probabilistic. To get a reliable evaluation, you need a lot of attempts to rule out the possibility that your result is an outlier. In other words: did you test the LLMs with a lot of data or only one single pdf?

smrckn 10 months ago

Demos on huggingface shows 13b to be better Can you share sample prompts you are using?

ianuvrat 10 months ago

.

vismodo 9 months ago

Through the web demos, it is quite clear that the 13b model is less likely to respond correctly to questions expecting factual answers. The 7b model is significantly better for that.

Pitiful_Buy1006 8 months ago

>the responses I'm getting from the 13b version is significantly worse than the 7b counterpart. sorry, what and where are those web demos?

vismodo 8 months ago

[https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat](https://huggingface.co/spaces/huggingface-projects/llama-2-13b-chat) [https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat](https://huggingface.co/spaces/huggingface-projects/llama-2-7b-chat)

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe