username

Aditya Chinchure

Multimodal Learning @ UBC | Photography

Publications

VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge

VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge

Sahithya Ravi*, Aditya Chinchure*, Leonid Sigal, Renjie Liao, Vered Schwartz (*equal contribution)

Oct 23, 2022

We present a new Vision-Language-Commonsense transformer model, VLC-BERT, that incorporates contextualized knowledge using Commonsense Transformer (COMET) to solve Visual Question Answering (VQA) tasks that require commonsense reasoning.

WACV 2023

Academic Projects

DE-TensoRF: Data-efficient and fast NeRFs

DE-TensoRF: Data-efficient and fast NeRFs

Apr 28, 2023

Developed DE-TensoRF, a model that can render 3D objects with as few as 3 images, and in under 15 min on a single GPU. We achieved the highest grade in our class, and led to collaboration efforts with Dr. Helge Rhodin’s research group.

A Summary of Recent Text Summarization Techniques

A Summary of Recent Text Summarization Techniques

Dec 03, 2020

In this project paper, we surveyed text summarization models by evaluating existing extractive and abstractive models. We studied the metrics and datasets used to evaluate the latest models and evaluated upcoming abstractive techniques. Finally, we highlighted future pathways for text summarization and suggested areas for improvement

Other Projects