Transformers, introduced in 2017, are a pivotal architecture for LLMs, designed around the concept of attention, which helps process longer text sequences efficiently
developers.google.com/machine-learni…#LLM#NLP
LLMs enhance conversational AI: Large Language Models (LLMs) have significantly improved the quality and scalability of conversational AI applications across various industries and use cases.
#LLM#aigraphtowardsdatascience.com/redefining-con…
ChatGPT can now browse the internet to provide you with current and authoritative information, complete with direct links to sources. It is no longer limited to data before September 2021.
it does a great job of seeking areas in the data that have a high density of observations, versus areas of the data that are not very dense with observations. DBSCAN can sort data into clusters of varying shapes as well
elutins.medium.com/dbscan-what-is…#DBSCAN#aigraph
BERT’s secret sauce lies in its ability to understand the bidirectional context.
@shaikhrayyan123/a-comprehensive-guide-to-understanding-bert-from-beginners-to-advanced-2379699e2b51" target="_blank" rel="nofollow noopener">medium.com/@shaikhrayyan1…
@zaiinn440/a-comparative-analysis-of-llms-like-bert-bart-and-t5-a4a873251ff" target="_blank" rel="nofollow noopener">medium.com/@zaiinn440/a-c…
In the case of BERT, the model is trained to predict one word for the corresponding mask. But T5 is hybrid. It is trained to output one word or multiple words for one mask
Euclidean distance is sensitive to the magnitudes of the vectors, whereas cosine similarity is not.
@swansburg.justin/how-to-use-llms-to-build-better-clustering-models-9b17a5491bb4" target="_blank" rel="nofollow noopener">medium.com/@swansburg.jus…
Imagine islands on the ocean, where the sea level is the threshold and the different islands are your clusters. The land below the sea level is noise. As the sea level goes down, new islands appear some islands combine to form bigger island
#aigraphtowardsdatascience.com/a-gentle-intro…
the proposed HAC ultilize the information of the algorithm, reduce the unnecessary computation.
Used unsupervised classification accuracy (confusion matrix) to measure the performance. It' s quite intuitive and reasonable.
#aigrapharxiv.org/abs/1301.7401
Use grant-to-article to validate the unsupervised task. It might be a potential metric that we can apply on our clustering. #aigraphjournals.plos.org/plosone/articl…