Viewing a single comment thread. View all comments

Kamimashita t1_j5usxq7 wrote

Do you know if any of those other models have APIs for queries/inference or finetuning? I've looked into GPT alternatives and most seem closed off.

2

MysteryInc152 t1_j5uvo3i wrote

Nothing that would beat Open AI's stuff (Google's stuff) is open for inference or finetuning from the public.

I think the best Open source alternative is this

https://github.com/THUDM/GLM-130B

https://huggingface.co/spaces/THUDM/GLM-130B

But it's not finetuned for instruction so you have to prompt/approach it like a text completer. And also you'll need a 4x3090 to get it running locally.

The best open source instruction finetuned models are the flan t5 models

https://huggingface.co/google/flan-t5-xxl

If you're not necessarily looking for open source but still actual alternatives that aren't just an API wraparound of GPT, you can try cohere

https://cohere.ai/pricing

Good thing is that it's completely free for non commercial or non production use

or alephalpha

https://app.aleph-alpha.com/

Not free but the pricing is decent and they have a visual language model as well. Something like flamingo

https://www.deepmind.com/blog/tackling-multiple-tasks-with-a-single-visual-language-model

6

currentscurrents t1_j5uvgk8 wrote

Not at this time. Google says they're going to release some kind of LLM-based product this year though.

1