Did you use prompt like <a href="https://github.com/hwchase17/langchain/blob/bc2ed93b7

hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

How to ask LLM generate ReAct format? about react HOT 5 CLOSED

linonetwo commented on July 16, 2024

How to ask LLM generate ReAct format?

from react.

Comments (5)

timothylimyl commented on July 16, 2024 1

I think the data collected for fine-tuning in this paper does not really require to take very long since the author just mention that they prompted the larger language models for the data, it is basically knowledge distillation.

from react.

timothylimyl commented on July 16, 2024

In the langchain example, they prepend the [EXAMPLES] which are examples of how to go about following the REact framework, this is basically few-shot learning based off prompt. This is purely prompt engineering and does not touch the weights of the model.

The method is correct. You can also use the examples for fine-tuning the llms if you have the resources (data + compute) and want better results as shown by the author of the paper in a few different datasets challenges.

from react.

linonetwo commented on July 16, 2024

Thank you for the confirmation!

So in your paper, you are fine-tuning, which produces better output but needs a long manual data preparation period. And while fine-tuning save some token when calling API, it also increases each API call's cost. So each has pros and cons.

I will use few-shot prompt engineering as a start, and collect data for fine-tuning.

from react.

ysymyth commented on July 16, 2024

hi @linonetwo , is there a followup question?

from react.

linonetwo commented on July 16, 2024

I was confused about why it could do this. But I read more materials these days and I know even OpenAI doesn't know why there is the emergence.

from react.

Recommend Projects

How to ask LLM generate ReAct format? about react HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent