7 Commits

Author SHA1 Message Date
H
0f5ab5c854 [doc] feat: add retool blog (#2761)
### What does this PR do?

add link to the retool blog

### Checklist Before Starting

- [ ] Search for similar PRs. Paste at least one query link here: ...
- [ ] Format the PR title as `[{modules}] {type}: {description}` (This
will be checked by the CI)
- `{modules}` include `fsdp`, `megatron`, `sglang`, `vllm`, `rollout`,
`trainer`, `ci`, `training_utils`, `recipe`, `hardware`, `deployment`,
`ray`, `worker`, `single_controller`, `misc`, `perf`, `model`, `algo`,
`env`, `tool`, `ckpt`, `doc`, `data`
- If this PR involves multiple modules, separate them with `,` like
`[megatron, fsdp, doc]`
  - `{type}` is in `feat`, `fix`, `refactor`, `chore`, `test`
- If this PR breaks any API (CLI arguments, config, function signature,
etc.), add `[BREAKING]` to the beginning of the title.
  - Example: `[BREAKING][fsdp, megatron] feat: dynamic batching`
2025-07-26 13:13:55 +08:00
H
83cb13ad53 [recipe, doc] fix: fix dapo branch name (#2090)
### Checklist Before Starting

- [x] Searched for similar PR(s).
- [x] Checked PR Title format
  - In format of: [modules] type: Title
- modules are in `fsdp, megatron, sglang, vllm, rollout, trainer, ci,
training_utils, recipe, hardware, deployment, ray, worker,
single_controller, misc, perf, model, algo, env, tool, ckpt, doc, data`
  - type is in `feat, fix, refactor, chore, test`
- can involve multiple modules, seperated by `,` or space, like
`[megatron, fsdp, doc] feat: xxx`

### What does this PR do?

As title
2025-06-19 09:35:05 +08:00
H
897619d738 [tests] chore: add PR title check (#1901)
### Checklist Before Starting

- [ ] Search for similar PR(s).

### What does this PR do?

> Add one-line overview of what this PR aims to achieve or accomplish. 

### High-Level Design

> Demonstrate the high-level design if this PR is complex.

### Specific Changes

> List the specific changes.

### API

> Demonstrate how the API changes if any.

### Usage Example

> Provide usage example(s) for easier usage.

```python
# Add code snippet or script demonstrating how to use this 
```

### Test

> For changes that can not be tested by CI (e.g., algorithm
implementation, new model support), validate by experiment(s) and show
results like training curve plots, evaluatuion results, etc.

### Additional Info.

- **Issue Number**: Fixes issue # or discussion # if any.
- **Training**: [Note which backend this PR will affect: FSDP, Megatron,
both, or none]
- **Inference**: [Note which backend this PR will affect: vLLM, SGLang,
both, or none]

### Checklist Before Submitting

- [ ] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
- [ ] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).
- [ ] Add `[BREAKING]` to the PR title if it breaks any API.
- [ ] Update the documentation about your changes in the
[docs](https://github.com/volcengine/verl/tree/main/docs).
- [ ] New CI unit test(s) are added to cover the code path.
- [ ] Rely on existing unit tests on CI that covers the code path.
2025-06-07 18:08:14 +08:00
45aec859d6 Fixed URL for ProRL in README.md (#1866)
Fixed URL for ProRL in README.md
2025-06-05 22:43:52 +08:00
2fe47f71ab Add ProRL to README.md (#1855)
ProRL is a novel training methodology that incorporates KL divergence
control, reference policy resetting, and a diverse suite of tasks. The
empirical analysis reveals that RL-trained models consistently
outperform base models across a wide range of pass@k evaluations,
including scenarios where base models fail entirely regardless of the
number of attempts.

It is developed based on Verl. 

Link: https://arxiv.org/abs/2505.24864
2025-06-05 17:51:11 +08:00
877e097f74 README: add back DeepRetrieval and add a new work s3 (#1592)
### Checklist Before Starting

- [ ] Search for similar PR(s).

### What does this PR do?

> (1) Add back DeepRetrieval (the **first** search agent framework
interacting with search engine) to the "awesome work" of main page, and
(2) add a new work s3 (much more efficient way (70x less data) to train
an powerful search agent!)

### High-Level Design

> Only updates two readme files.

### Specific Changes

> (1) Added "- [DeepRetrieval](https://github.com/pat-jj/DeepRetrieval):
RL Training of **Search Agent** with **Search/Retrieval Outcome**
![GitHub Repo
stars](https://img.shields.io/github/stars/pat-jj/DeepRetrieval)" to the
main page's README.md. (2) Added "- [s3](https://github.com/pat-jj/s3)
**Efficient Yet Effective** Search Agent Training via RL ![GitHub Repo
stars](https://img.shields.io/github/stars/pat-jj/s3)" to the
recipe/README.md

### API

> N/A

### Usage Example

> N/A

### Test

> N/A

### Additional Info.

N/A

### Checklist Before Submitting

- [] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
- [] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).
- [N/A] Add `[BREAKING]` to the PR title if it breaks any API.
- [N/A] Update the documentation about your changes in the
[docs](https://github.com/volcengine/verl/tree/main/docs).
- [N/A] Add CI test(s) if necessary.
2025-05-19 16:28:31 -07:00
H
771bd756b3 [misc] docs: move dev folder to scripts. add sandbox documentation to index.rst. (#1539)
### Checklist Before Starting

- [x] Search for similar PR(s).

### What does this PR do?

- move dev folder to scripts @ETOgaosion 
- add sandbox documentation to index.rst @chenhaiq  
- installation docs have been updated

### Additional Info.

- **Issue Number**: Fixes issue # or discussion # if any.
- **Training**: [Note which backend this PR will affect: FSDP, Megatron,
both, or none]
- **Inference**: [Note which backend this PR will affect: vLLM, SGLang,
both, or none]

### Checklist Before Submitting

- [x] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
- [ ] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).
- [ ] Add `[BREAKING]` to the PR title if it breaks any API.
- [x] Update the documentation about your changes in the
[docs](https://github.com/volcengine/verl/tree/main/docs).
- [ ] Add CI test(s) if neccessary.
2025-05-16 08:12:31 +08:00