Comments (4)
These are very interesting direction to explore, and we definitely see the possibility that ChatArena could evolve in these directions. We completely agree that making the environments more accessible to ordinary users and more enjoyable for human player bear huge value. But due to our limited capacity, our team currently focuses on enriching the set of environments and LLMs backends, so that more LLMs can interact with each other in various diverse environments. We would like to welcome community contribution to make it more fun to play with and develop more intuitive interfaces.
We are going to release a detailed development plan this week. Please join our Slack channel to get the latest updates of ChatArena.
from chatarena.
See GPT-Bargaining for a tournament!
https://github.com/FranxYao/GPT-Bargaining
Basically we ask GPT/ Claude/ Cohere/ Jurrasic models to bargain with each other and see who can get a better deal.
Our ranking is: GPT-4 > Claude-v1.3 > GPT-3.5-Trubo > Claude-instant-v1.0 > jurrasic > cohere
We have just finished the paper and will very soon integrate the bargaining game into ChatArena!
from chatarena.
Thank you all for your replies. I hope this project will develop.
This is an interesting experiment, and it is great that Claude v1.3 is performing well.
I look forward to demonstrating it at ChatArena.
from chatarena.
We've announce our coming features and future directions in #36. Feel free to comment and request new features there if your wanted feature is not covered.
from chatarena.
Related Issues (20)
- OpenAI latency in gradio app (5+ seconds to generate a response) HOT 1
- Add disclaimer HOT 1
- AssertionError: openai package is not installed or the API key is not set HOT 1
- Feature request: RL libraries integration HOT 10
- The web display is incomplete, and can't submit. HOT 1
- Always get same error: ERROR: [Errno 10048] error while attempting to bind on address ('127.0.0.1', 7860) HOT 1
- Message Pool
- Support Langchain Agents HOT 2
- orjson package rust requirement HOT 2
- Installation issues/circular imports HOT 1
- Upcoming Features and Future Directions of Chat Arena HOT 1
- Postprocessing of agent name removal HOT 1
- Fix OpenAI errors HOT 2
- Conflicting dependencies HOT 6
- Optional dependencies required even for environments which do not use them HOT 1
- Outline
- Update Anthropic Client HOT 2
- got a "TypeError: Tab.__init__() got an unexpected keyword argument 'visible'" when perform 'gradio app' HOT 2
- HF demo does not work and has an error HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chatarena.