ykpypl Goto Github PK
Type: User
Type: User
[CVPR 2024🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
a state-of-the-art-level open visual language model | 多模态预训练模型
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
deep learning for image processing including classification and object-detection etc.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、科知识图谱、清华大学人工智能技术系列报告、自然语言生成、NLU太难了系列、
gpt4all: run open-source LLMs anywhere
this is my first repository
ImageBind One Embedding Space to Bind Them All
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger
LLaVA-Interactive-Demo
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multi-View Transformer for 3D Visual Grounding [CVPR 2022]
Examples and guides for using the OpenAI API
"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Utilities for working with videos
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
Datasets, Transforms and Models specific to Computer Vision
Webots Robot Simulator
Code for "Distilling coarse-to-fine semantic matching knowledge for weakly supervised 3D visual grounding" (ICCV 2023)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.