Rowel Atienza ๐ต๐ญ
102 posts

Rowel Atienza ๐ต๐ญ
@jacobe
Creator of ViTSTR and EfficientSpeech (ICASSP2023) and co-creator of PARSeq. Professor & Scientist at the University of the Philippines.
University of the Philippines ๊ฐ์
์ผ ลubat 2007
1K ํ๋ก์338 ํ๋ก์

Most ARM chips can't run decent AI models. Introducing EfficientSpeech, a 266k-param TTS model. Low cost ARM chips like in RPi4 can generate 104sec of speech mel spec in 1sec. Here's an AI-generated video w/ voice from EfficientSpeech. Info: github.com/roatienza/effiโฆ #ICASSP2023
English

@arXiv_Daily Simple yet effective idea: Remove inefficient top-most layers & replace them with an efficient head. For VWW, param count reduced by 93% with only 0.65% accuracy decrease. Counterintuitively, the quantized pruned net increased its accuracy on ARM Cortex M0.
English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

Depth Pruning with Auxiliary Networks for TinyML
deepai.org/publication/deโฆ
by Josen Daniel De Leon and @jacobe
#NeuralNetwork #ComputerScience
English

@ak92501 @Gradio @huggingface Thanks. Let's keep on building better infrastructure and tooling to make AI more accessible.
English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

"The UP National Engineering Center Analytics and Data Science Certifications announced the development on Wednesday." gmanetwork.com/news/scitech/tโฆ via @gmanews
English

Idea: If data augmentation improves model generalization, why not use it to generate 2 new inputs and force the representations to agree. Result: Additional model performance improvement. Comparison: Unlike Label Smoothing, the performance of our method, AgMax, is consistent.
DeepAI@DeepAI
Improving Model Generalization by Agreement of Learned Representations from Data Augmentation deepai.org/publication/imโฆ by @jacobe #ComputerVision #ImageNet
English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

I am excited to share my latest work: 8-bit optimizers โ a replacement for regular optimizers. Faster ๐, 75% less memory ๐ชถ, same performance๐, no hyperparam tuning needed ๐ข. ๐งต/n
Paper: arxiv.org/abs/2110.02861
Library: github.com/facebookresearโฆ
Video: youtube.com/watch?v=IxrlHAโฆ

YouTube

English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

Weโre introducing GSLM, the first language model that breaks free completely of the dependence on text for training. This โtextless NLPโ approach learns to generate expressive speech using only raw audio recordings as input. Learn more and get the code:
ai.facebook.com/blog/textless-โฆ

English

@arXiv_Daily Data Augmentation for STR will be presented at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision. GitHub: github.com/roatienza/straโฆ
English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

Data Augmentation for Scene Text Recognition
deepai.org/publication/daโฆ
by @jacobe
#ComputerVision #ComputerVision
English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

ODSC APAC Virtual Conference Speaker: hubs.ly/H0V892b0 #ODSCAPAC #DataScience #DeepLearning @jacobe @upsystem

English


@arXiv_Daily It took us more than a year building, collecting, annotating, validating and benchmarking this dataset.
Dataset: github.com/upeee/GOO-GAZEโฆ
To appear at #CVPR2021 Workshop: gazeworkshop.github.io/2021/
English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

GOO: A Dataset for Gaze Object Prediction in Retail Environments
deepai.org/publication/goโฆ
by Henri Tomas et al. including @jacobe
#Estimator #Statistics
English

@Deep__AI Vision Transformer (ViT) for reading real-world text images. My paper will be presented at #ICDAR2021 icdar2021.org.
English
Rowel Atienza ๐ต๐ญ ๋ฆฌํธ์ํจ

Vision Transformer for Fast and Efficient Scene Text Recognition
deepai.org/publication/viโฆ
by @jacobe
#ComputerVision #PatternRecognition
English

Yesterday, my former grad student Daryl gave a talk at Sony CSL Paris about his thesis on Next View Policy for 3D Reconstruction. Youtube: youtu.be/KdyDj3bjU0I

YouTube
English



