Giter VIP home page Giter VIP logo

dataselection_ondevicelearning's Introduction

Enabling On-Device Large Language Model Personalization with Self-Supervised Data Selection and Synthesis

This repository contains the implementation and resources for the paper "Enabling On-Device Large Language Model Personalization with Self-Supervised Data Selection and Synthesis," accepted at the Design Automation Conference (DAC) 2024. Our work presents a novel framework for personalizing large language models (LLMs) on-device through self-supervised data selection and synthesis techniques. This approach enables efficient and effective personalization without compromising user privacy or requiring extensive computational resources.

Paper Abstract

With the increasing deployment of large language models (LLMs) in on-device applications, personalizing these models to better reflect individual user preferences and contexts has become crucial. However, traditional personalization methods often rely on extensive data collection and processing, raising privacy and efficiency concerns. We propose a self-supervised framework that leverages user interaction data to selectively synthesize personalized training datasets. This method significantly enhances the personalization effectiveness of LLMs while operating within the constraints of on-device computing resources. Our experiments demonstrate notable improvements in model performance across various tasks, showcasing the potential of our approach for on-device LLM personalization.

Citation

If you find our work useful for your research or if you use parts of this code in your own projects, please consider citing our paper:

@article{qin2023enabling,
  title={Enabling On-Device Large Language Model Personalization with Self-Supervised Data Selection and Synthesis},
  author={Qin, Ruiyang and Xia, Jun and Jia, Zhenge and Jiang, Meng and Abbasi, Ahmed and Zhou, Peipei and Hu, Jingtong and Shi, Yiyu},
  journal={arXiv preprint arXiv:2311.12275},
  year={2023}
}

dataselection_ondevicelearning's People

Contributors

ruiyangqin2016 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.