openai / clip-featurevis Goto Github PK
View Code? Open in Web Editor NEWcode for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"
code for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"
Can someone make a colab notebook for this?
In the model file there is a self.graph_def referenced on line 38 but I can't seem to see where it is defined.
Dear OpenAI team,
Thank you for sharing this great implementation, I really kile it.
In the section 'Understanding Language', you mention about:
If we fix a neuron on the vision side, we can search for the text that maximizes the logit. We do this with a hill climbing algorithm to find what amounts to the text maximally corresponding to that neuron.
Would you mind specify how you do this? You use a dataset of sentences, or optimize the senences using pretrained language model? The vision model only connects to language model in the output part, how you get the gradient to do optimization? How you do text feature visualization?
Thank you for your help.
Best Wishes,
Alex
Imagen' key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis: increasing the size of the language model in Imagen boosts both sample fidelity and image-text alignment much more than increasing the size of the image diffusion model.
Theyalso find that while T5-XXL and CLIP text encoders perform similarly on simple benchmarks such as MS-COCO, human evaluators prefer T5-XXL encoders over CLIP text encoders in both image-text alignment and image fidelity on DrawBench, a set of challenging and compositional prompts.
So try to visualize the features of T5?
T5: https://github.com/google-research/text-to-text-transfer-transformer
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.