Jay van Zyl

3.2K posts

Jay van Zyl banner
Jay van Zyl

Jay van Zyl

@jayvanzyl

Interested in real-time predictions and experimentation https://t.co/yz8MMNzslJ

Palo Alto, CA Katılım Mayıs 2008
830 Takip Edilen1K Takipçiler
Jay van Zyl
Jay van Zyl@jayvanzyl·
Important factors to consider wrt cost of model training and serving: “SOTA models these days have about ~500B parameters and that represents at least ~1TB of GPU memory to operate with specialized infrastructure. That's a minimum of ~$60,000 - $100,000 p…lnkd.in/gKRDCbfa
English
0
0
4
246
Jay van Zyl
Jay van Zyl@jayvanzyl·
StableLM is trained on a new experimental dataset built on The Pile, but three times larger with 1.5 trillion tokens of content. The richness of this dataset gives StableLM surprisingly high performance in conversational and coding…lnkd.in/g3UA_jPw lnkd.in/gQ47SQU7
English
0
0
0
99
Jay van Zyl
Jay van Zyl@jayvanzyl·
The analogy between the syntax-semantics of natural languages and the sequence-function of proteins has revolutionized the way humans inves- tigate the language of life. lnkd.in/g_wnAatf
English
0
0
0
88
Jay van Zyl
Jay van Zyl@jayvanzyl·
With YouTube creators becoming increasingly empowered by versatile generative AI tools, it will only amplify the rising trend of audiences consuming more user-generated content on TVs, conducive to more YouTube advertising revenue,…lnkd.in/g-ZSaH7p lnkd.in/g6xjWzgM
English
0
0
0
69
Jay van Zyl
Jay van Zyl@jayvanzyl·
They say a good craftsman shouldn't blame his tools, but can a good tool [LLM] blame a shoddy craftsman? But Large language models specialize in generating human-like text. Correct answers are a bonus. lnkd.in/gVXfvhSE
English
0
0
0
74
Jay van Zyl
Jay van Zyl@jayvanzyl·
Another key concept to understand: Most of the AI-generated images currently produced rely on Diffusion Models as their foundation. lnkd.in/gxdM5sAJ
English
0
0
0
39
Jay van Zyl
Jay van Zyl@jayvanzyl·
Together with ecosystem.Ai real-time behavioral capabilities, generative models add a much needed angle to AI for business usefulness. Here is a another outline in summary for those who need a quick reference: Generativ…lnkd.in/g7pGgkep lnkd.in/gxkQY9KW
English
0
0
2
52
Jay van Zyl
Jay van Zyl@jayvanzyl·
Cape Town looks like a safe option while we're working on solving all of this :) lnkd.in/gMYRVb_M
English
0
0
0
32
Jay van Zyl
Jay van Zyl@jayvanzyl·
Excellent share @dxbrob. "It is perhaps uncontroversial to say that this claim that one of us made eight years ago (Soman, 2015) is now accepted as universal truth. Governments, for-profit organizations, not for profits, startups, consumer protect…lnkd.in/gGprFd8y
English
0
0
0
38
Jay van Zyl
Jay van Zyl@jayvanzyl·
FinGPT emphasizes the critical significance of data collecting, cleaning, and preprocessing in creating open-source FinLLMs using a data-centric approach. FinGPT seeks to advance financial research, cooperation, and innovation by p…lnkd.in/gcDUmgy7 lnkd.in/g69ivMnZ
English
0
0
1
83
Jay van Zyl
Jay van Zyl@jayvanzyl·
Great paper on transformers: “Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on…lnkd.in/gsxXReqV lnkd.in/gwxwhsaN
English
0
0
1
82
Jay van Zyl
Jay van Zyl@jayvanzyl·
Gorilla is a major addition to the list of language models, as it even addresses the issue of writing API calls. Its capabilities enable the reduction of problems related to hallucination and reliability. lnkd.in/g7g_qd-E
English
0
0
0
43
Jay van Zyl
Jay van Zyl@jayvanzyl·
Another great set of models. Why use Falcon-40B? 1. It is the best open-source model currently available. Falcon-40B outperforms LLaMA, StableLM, RedPajama, MPT, etc. See the OpenLLM Leaderboard. 2. It features an architecture optimized for inference, wit…lnkd.in/gR-sq7cK
English
0
0
0
107