
@chamath You get your model to summarize the session then you chunk, embedd and load to a database (Azure Sql, Cosmos or Postgresql) as Vector data type. You then add a DiskANN index and for each new session you can pull the last x records and also the top 20 similar records.
English
























