Abhilash Shankarampeta

19 posts

Abhilash Shankarampeta banner
Abhilash Shankarampeta

Abhilash Shankarampeta

@areddys53

Teleporting to the next Era | Prev - @AmazonScience @meeshotech | UCSD | IITG

San Diego, CA Se unió Kasım 2018
1K Siguiendo71 Seguidores
Abhilash Shankarampeta retuiteado
Yujie Zhao
Yujie Zhao@YujieZhao455906·
We are excited to release AMA-Bench 🎉 Our goal is to evaluate agent memory itself, not just dialogue. Many existing memory benchmarks are still centered on conversation or long context. But real agent memory happens over long horizon agent-environment trajectories, with machine generated observations, evolving states, and causal structure. AMA-Bench includes real world + synthetic settings across multiple domains, and we also introduce AMA-Agent. Paper: huggingface.co/papers/2602.22… Project: ama-bench.github.io Dataset: huggingface.co/datasets/AMA-b… #LLM #AgentMemory #Memory #Agent
Yujie Zhao tweet mediaYujie Zhao tweet mediaYujie Zhao tweet mediaYujie Zhao tweet media
English
3
16
94
7.8K
Abhilash Shankarampeta retuiteado
Hao AI Lab
Hao AI Lab@haoailab·
Seedance-2 and Kling-3 signal that AI video generation is entering a “photorealistic” era, but realism does not guarantee reasoning and scientific correctness. In the classic breaking dry spaghetti experiment, real fractures arise from elastic energy and stress waves, often producing three pieces. Yet models frequently generate physically incorrect snaps. We introduce VideoScience-Bench to evaluate scientific reasoning in video generation. Most models look convincing but lack true scientific understanding, with only weak signals from Kling-3, Sora-2, and Veo-3. Learn details from our blog 👇🧐🧪 hao-ai-lab.github.io/blogs/videosci…
English
1
8
31
5.3K
Varun Yerram
Varun Yerram@varunyer·
⭐Soo happy to share that I'll be joining @NYUDataScience as a PhD student this Fall! Excited to learn from and work with @eunsolc, @hhexiy, and the amazing folks at @CILVRatNYU. Looking forward to better understanding and improving large ML models.
English
12
2
104
6.4K
Abhilash Shankarampeta
Abhilash Shankarampeta@areddys53·
So excited to share this work #TRANSIENTTABLES 🎉 Addressing temporal reasoning gaps in LLMs is crucial. Honored to present our work on LLM temporal reasoning at @naaclmeeting. See you at #NAACL2025!
Vivek Gupta@keviv9

#NAACL2025 Paper 1/6 🚀 Thrilled to announce our paper, "TRANSIENTTABLES: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables," has been accepted for an oral presentation at #TRANSIENTTABLES #NAACL2025! 🎉 We explore how LLMs handle information that changes over time in tables. Check it out: transienttables.github.io

English
2
0
10
367
Abhilash Shankarampeta retuiteado
Mubashara Akhtar
Mubashara Akhtar@akhtarmubashara·
I am in Singapore for #EMNLP2023. Check out poster our “Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data“. We know SOTA LLMs struggle with numbers, but where exactly does the challenge lie? 🧵 arxiv.org/pdf/2311.02216…
Mubashara Akhtar tweet media
English
1
2
27
2.4K
Abhilash Shankarampeta retuiteado
Vivek Gupta
Vivek Gupta@keviv9·
🌟 News: I'm on the academic market! 🌟 🔍 Seeking: Tenure-Track Faculty positions in Computer Science. I work on Natural language processing systems. 💡 Expertise: Elevating reasoning and inference capabilities in semi-structured tabular data. #NLProc #AI #AcademicTwitter
Vivek Gupta tweet media
English
2
25
122
19.1K
Abhilash Shankarampeta
Abhilash Shankarampeta@areddys53·
Excited to share that our paper is accepted at #EMNLP2023 🎉. In this work we evaluated the language models across various numerical reasoning tasks on tabular data. It was pleasure working with @akhtarmubashara, @keviv9 , Arpit
Vivek Gupta@keviv9

2️⃣ Tabular Numerical Reasoning Evaluation (finding), led by @akhtarmubashara (first-author), @areddys53 (co-first author), Arpit, and mentoring by Oana, @esimperl. Three time-zone work experience. Thanks to @kclinformatics, @IITGuwahati, @cogcomp, @upennnlp. 🙌🎉 for support-2/2.

English
0
0
8
523
Abhilash Shankarampeta retuiteado
Vivek Gupta
Vivek Gupta@keviv9·
4. Enhancing Tabular Reasoning with Pattern Exploiting Training @suki_2022 (Non-Archival), we use the PET (AdaPET) pre-training i.e. MLM objective jointly train with tabular inference task for entity style tabular data. Joint work with @areddys53. -8/n
English
1
2
6
0
Parth Shah
Parth Shah@parthsh_·
Trying out gradient by Paperspace today, to reduce my dependence on Colab. Pros - No timeouts, A Jupyter lab environment that can be modified and is more responsive than Colab. Cons - 2 CPUs of 2 Gb only, the promised free GPU is also not always available. What do you guys use?
English
1
0
4
0
Abhilash Shankarampeta retuiteado
IITG.ai
IITG.ai@iitgai·
The Animal-AI Olympics is an AI competition with tests inspired by animal cognition. The Animal-AI environment is a new AI experimentation and evaluation platform that implements ideas from animal cognition in order to better train AI agents that possess cognitive skills.
GIF
English
1
1
5
0
Abhilash Shankarampeta retuiteado
IITG.ai
IITG.ai@iitgai·
#Fake_News_Detection Identifying fake news is one of the most challenging and open ended tasks of AI today! We have seen the devastating effects of Fake News especially nowadays in the case of the Covid-19 pandemic..
English
3
1
4
0