Files
verl/recipe/langgraph_agent/example/math_expression.py
Joel 7aabfc437b [rollout] feat: add ReactAgentLoop based on LangGraph (#2463)
### What does this PR do?

This is an initial effort to integrate LangGraph into agent loop:
1. add a LangGraph react agent loop implementation
2. add math expression example to demonstrate react agent loop usage.

### Design & Code Changes

New components
- ChatModel: [custom chat
model](https://python.langchain.com/docs/how_to/custom_chat_model/)
using LangChain abstractions, implementing following abstract method:
  - bind_tools:  bind tools to the model
  - _generate:  native async generate chat completion message

- ReactAgentLoop: [LangGraph react
agent](https://langchain-ai.github.io/langgraph/agents/overview/) which
can use tools to perform tasks.

<img width="593" height="467" alt="image"
src="https://github.com/user-attachments/assets/d629b170-03c5-4810-a6b0-4dc27a285c0e"
/>

### Checklist Before Submitting

> [!IMPORTANT]
> Please check all the following items before requesting a review,
otherwise the reviewer might deprioritize this PR for review.

- [ ] Read the [Contribute
Guide](https://github.com/volcengine/verl/blob/main/CONTRIBUTING.md).
- [ ] Apply [pre-commit
checks](https://github.com/volcengine/verl/blob/main/CONTRIBUTING.md#code-linting-and-formatting):
`pre-commit install && pre-commit run --all-files --show-diff-on-failure
--color=always`
- [ ] Add / Update [the
documentation](https://github.com/volcengine/verl/tree/main/docs).
- [ ] Add unit or end-to-end test(s) to [the CI
workflow](https://github.com/volcengine/verl/tree/main/.github/workflows)
to cover all the code. If not feasible, explain why: ...
- [ ] Once your PR is ready for CI, send a message in [the `ci-request`
channel](https://verl-project.slack.com/archives/C091TCESWB1) in [the
`verl` Slack
workspace](https://join.slack.com/t/verl-project/shared_invite/zt-3855yhg8g-CTkqXu~hKojPCmo7k_yXTQ).
2025-07-16 13:41:04 +08:00

40 lines
1.3 KiB
Python

# Copyright 2024 Bytedance Ltd. and/or its affiliates
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
from langchain_core.tools import tool
from recipe.langgraph_agent.react_agent_loop import ReactAgentLoop
@tool(parse_docstring=True)
def calculate(a: int, b: int, operand: str) -> int:
"""
Compute the results using operand with two integers
Args:
a: the first operand
b: the second operand
operand: '+' or '-' or '*' or '@'
"""
assert operand in ["+", "-", "*", "@"], f"unknown operand {operand}"
if operand == "@":
return 3 * a - 2 * b
return eval(f"{a} {operand} {b}")
class MathExpressionReactAgentLoop(ReactAgentLoop):
@classmethod
def init_class(cls, config, tokenizer, **kwargs):
cls.tools = [calculate]
super().init_class(config, tokenizer)