I need to have a little lie down after reading about Taalas, the startup that claims to offer a 1000x speed up in LLM inferencing by building dedicated ASICs per model.
Uncategorized
1
Posts
1
Posters
0
Views
-
I need to have a little lie down after reading about Taalas, the startup that claims to offer a 1000x speed up in LLM inferencing by building dedicated ASICs per model. Well done lads, you’ve invented what? A LUT? A Rom? They claim to be able to churn new models out in two months but their first prototype runs a two year old Llama model badly quantised down to 3 bits. I’m about 90% certain there will be a Theranos outcome for this company.
-
R relay@relay.infosec.exchange shared this topic on