I'm Diganta Misra, founder of a research group Landskape, and a Research MSc in CS (Machine Learning specialization) at MILA, Montreal affiliated with UdeM supervised by Professor Irina Rish. I mostly focus on Generative Modeling, Continual Learning, LLMs for Code Generation and Sample Complexity.
News & Updates: (Click to expand)
- June 2024: Our work on Slight Corruption in Pre-training Data Makes Better Diffusion Models is out now on arXiv.
- April 2024: Our work on Uncovering the Hidden Cost of Model Compression is accepted at the Prompting in Vision workshop, CVPR, 2024.
- April 2024: Our work on On the low-shot transferability of [V]-Mamba is accepted at the Prompting in Vision workshop, CVPR, 2024.
- April 2024: Our work on Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order is now out on arXiv.
- March 2024: Our work on Shapley Interactions for Complex Feature Attribution is out now on arXiv.
- March 2024: Our new preprint Just Say the Name: Online Continual Learning with Category Names Only via Data Generation is now out on arXiv.
- March 2024: Our new preprint On the low-shot transferability of [V]-Mamba is now out on arXiv.
- March 2024: I will be starting as an ELLIS PhD Fellow at the Max Planck Institute for Intelligent Systems (MPI-IS) under the supervision of Antonio Orvieto.
- March 2024: Our work on $\mathcal{D}^2$-Sparse: Navigating the low data learning regime with sparse networks is accepted (Oral) at the 5th PML4LRS workshop,ICLR, 2024.
- March 2024: Our work on GitChameleon: Breaking the version barrier for code generation models is accepted to 4th DMLR workshop,ICLR, 2024.
- October 2023: Our work on "Mitigating Mode Collapse in Sparse Mixture of Experts" is accepted to New in ML workshop, NeurIPS, 2023.
- October 2023: Our work on "Shapley Interactions for Complex Feature Attribution" is accepted to NeurIPS ATTRIB workshop, 2023.
- August 2023: Our new preprint on Reprogramming under constraints is now out on ArXiv.
- August 2023: Gave an invited talk at TU Eindoven on Learning under constraints.
- June 2023: I will be joining HSL, CMU in Fall 2023 as a Visiting Researcher.
- May 2023: Our work on Challenging Common Assumptions about Catastrophic Forgetting got accepted to CoLLAs, 2023.
- May 2023: Gave an invited talk at VITA, UT-Austin on Multi-Domain Expert Layers.
- April 2023: Our work on Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models got accepted to TMLR.
- March 2023: Our work on Pruning CodeBERT for Improved Code-to-Text Efficiency is accepted to the Sparsity in Neural Network (SNN) workshop @ ICLR, 2023.
- November 2022: Gave a talk titled Modality agnostic adaptation in deep learning at the IBM Generalisation talk series.
- November 2022: Our work on APP: Anytime Progressive Pruning is accepted to the SlowDNN workshop, 2023.
- November 2022: Our work on APP: Anytime Progressive Pruning is accepted to the Continual Lifelong Learning (CLL) workshop at ACML, 2022.
- July 2022: Our work on APP: Anytime Progressive Pruning is accepted to the Sparsity in Neural Network (SNN) workshop, 2022.
- June 2022: Our work on Scaling the Number of Tasks in Continual Learning got accepted to the CoLLAs 2022 workshop.
- June 2022: Our work on APP: Anytime Progressive Pruning is accepted to the Dynamic Neural Network (DyNN) workshop at ICML, 2022.
- May 2022: Awarded the MILA Entrepreneurs Grant worth CAD$5,000.
- May 2022: Awarded the AI Week 2022 Student Travel Bursary worth CAD$1,500.
- April 2022: Awarded the UNIQUE AI Excellence Scholarship worth C$10,000.
- April 2022: The preprint of our paper APP: Anytime Progressive Pruning is out now.
- April 2022: I am starting as a researcher at Morgan Stanley.
- March 2022: Awarded the DIRO x Quebec Ministry of Higher Education international students scholarship worth C$4000.
- February 2022: I will be serving as a Program Committee member for Conference on Lifelong Learning Agents(CoLLA) 2022.
- January 2022: I am selected to be a part of the MILA Winter 2022 Entrepreneurs Cohort.
- December 2021: I will be serving as a teaching assistant for the INF8225: Probabilistic Learning at Polytechnique University taught by Christopher J. Pal for the Winter 2022 semester.
- August 2021: Our fine grained tense modification task was accepted to Google's Big Bench.
- July 2021: I am also joining the VITA, UT-Austin as a Visiting Research Scholar to work on sparsity under the guidance of Assistant Professor Zhangyang Wang.
- May 2021: We are organizing the Spring Edition of the Weights & Biases ML Reproducibility Challenge. Visit our page to learn more.
- May 2021: I will be joining MILA as a graduate student this fall '21 under the supervision of Professor Irina Rish.
- January 2021: Our WACV paper's video is now out on YouTube. Watch it here.
- January 2021: I will be speaking at the W&B Deep Learning Salon on "From Smooth Activations to Robustness to Catastrophic Forgetting". I will be joined by Maithra Raghu from Google Brain. Watch it here.
- December 2020: I'm starting full time as a Machine Learning Engineer at Weights & Biases.
- October 2020: Our paper Rotate to Attend: Convolutional Triplet Attention Module is accepted to WACV 2021.
- September 2020: Gave a talk on my paper on Mish at the Robert Bosch Bangalore Research Office.
- August 2020: I completed my Undegraduate degree in Electronics and Electrical Engineering from Kalinga Institute of Industrial Technology (KIIT).
- August 2020: Gave a talk on Mish and Non-Linear Dynamics at Computer Vision Talks. Watch here.
- July 2020: My paper Mish: A Self Regularized Non-Monotonic Neural Activation Function is accepted at BMVC 2020.
- July 2020: CROWN: A comparison of morphology for Mish, Swish and ReLU produced in collaboration with Javier Ideami. Watch here.
- May 2020: Participated in an AMA for my paper on Mish at the Weights & Biases reading group.
- April 2020: Presented my views and discussed about Data Science on the The World is Ending Podcast. Listen to the episode here.
- February 2020: Talk on Mish and Non-Linear Dynamics at Sicara is out now. Watch here.
- February 2020: Podcast episode on Mish at Machine Learning Café is out now. Listen here.
- November 2019: Presented a talk on my paper on Mish at the University of Athens.
For more updates, please visit my personal webpage.