Skip to content

Add curriculum description, both reasoning and execution manual.#53

Merged
rbx merged 1 commit into
FairRootGroup:masterfrom
enlorenz:curr
Jun 10, 2026
Merged

Add curriculum description, both reasoning and execution manual.#53
rbx merged 1 commit into
FairRootGroup:masterfrom
enlorenz:curr

Conversation

@enlorenz

Copy link
Copy Markdown
Collaborator

No description provided.

@coderabbitai

coderabbitai Bot commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

Review Change Stack

Warning

Review limit reached

@enlorenz, we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 45 minutes and 1 second. Learn how PR review limits work.

Your organization has run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans include higher PR review limits than trial, open-source, and free plans. In all cases, reviews become available again over time. During sustained high-volume PR review activity, CodeRabbit may temporarily slow when the next review becomes available.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 60cc7fd2-2e46-4ab3-a3af-647487385030

📥 Commits

Reviewing files that changed from the base of the PR and between 882531b and 53c93c9.

📒 Files selected for processing (3)
  • README.md
  • curriculum_argument.md
  • curriculum_setup.md
📝 Walkthrough

Walkthrough

This PR introduces formal curriculum documentation for a scheduling RL task. A new curriculum rationale document defines the target deferral policy, learning challenges, and six-stage progression strategy. A companion operator runbook provides practical training instructions with stage-specific configurations. README links are updated to point to both documents.

Changes

Curriculum Documentation

Layer / File(s) Summary
Curriculum design rationale
curriculum_argument.md
New document formalizes the deferral-oriented curriculum strategy: defines target policy and control law, explains why the scheduling task has long-horizon credit assignment and local optima challenges, describes how reward design removes contradictions and creates structural pressure, justifies the six-stage progression (A–F), and lists measurable diagnostics for curriculum success.
Curriculum execution guide and documentation links
curriculum_setup.md, README.md
New runbook provides shared train–checkpoint–evaluate–prune workflow structure, command templates, and stage-specific argument configurations (A–F) with workload and pricing placeholders. README updated to link both curriculum documents from the project root instead of analysis/ subdirectory.

🎯 2 (Simple) | ⏱️ ~10 minutes

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name Status Explanation Resolution
Description check ❓ Inconclusive No pull request description was provided by the author, making it impossible to assess relevance. Add a brief description explaining the purpose of the curriculum documentation and how it relates to the project objectives.
✅ Passed checks (4 passed)
Check name Status Explanation
Title check ✅ Passed The title accurately summarizes the main changes: adding curriculum documentation with both reasoning and execution instructions.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@curriculum_setup.md`:
- Around line 228-239: Resolve the inconsistency between curriculum_setup.md and
curriculum_argument.md for "Stage E: Main Arrivals + Noisy Logic Prices" by
choosing one of the three fixes and applying it consistently: (A) mark Stage E
as optional/experimental in both documents by adding an explicit
"Optional/Experimental" label and a short justification and keep the placeholder
in curriculum_setup.md if you intend it optional; (B) if you intend Stage E as
required, remove the "usually skipped" remark and fill in the missing
stage-specific arguments in curriculum_setup.md (replace "[fill in
noisy-logic-price setup]" with the full noisy-logic-price configuration and
example commands) and update curriculum_argument.md to assert it is part of the
formal progression; or (C) retain it as optional but complete the specification
and add a clear "experimental/optional" notice in both files; ensure the
phrasing for "Stage E" is identical in curriculum_setup.md and
curriculum_argument.md so operators see the same status and provide the
noisy-logic-price setup snippet under the Stage E heading if you choose to keep
the stage runnable.
- Line 206: The sentence "Note: I trained on `2.0`, but staged scaling such as
`1.0 -> 2.0` is also possible." uses first-person voice; change it to an
operator-manual tone by rephrasing to something like "Trained on `2.0`; staged
scaling such as `1.0 -> 2.0` is also supported." Replace the original string
with this neutral phrasing in the curriculum_setup.md line where that note
appears to keep tone consistent with the rest of the document.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 77d43f95-97b4-4024-91ae-caf4602aeac5

📥 Commits

Reviewing files that changed from the base of the PR and between 8a71cf9 and 882531b.

📒 Files selected for processing (3)
  • README.md
  • curriculum_argument.md
  • curriculum_setup.md

Comment thread curriculum_setup.md
# --job-arrival-scale [SCALE]
```

Note: I trained on `2.0`, but staged scaling such as `1.0 -> 2.0` is also possible.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Replace first-person note with operator-manual voice.

The informal first-person phrasing "I trained on 2.0" is inconsistent with the rest of the document's operator-manual tone.

📝 Suggested rewording
-Note: I trained on `2.0`, but staged scaling such as `1.0 -> 2.0` is also possible.
+Note: Example scale values include `2.0` or staged scaling such as `1.0 -> 2.0`.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@curriculum_setup.md` at line 206, The sentence "Note: I trained on `2.0`, but
staged scaling such as `1.0 -> 2.0` is also possible." uses first-person voice;
change it to an operator-manual tone by rephrasing to something like "Trained on
`2.0`; staged scaling such as `1.0 -> 2.0` is also supported." Replace the
original string with this neutral phrasing in the curriculum_setup.md line where
that note appears to keep tone consistent with the rest of the document.

Comment thread curriculum_setup.md
Comment on lines +228 to +239
## Stage E: Main Arrivals + Noisy Logic Prices (Optional)

Goal: keep the learned policy while adding moderate price irregularity.

Comment: This stage is usually skipped. Another run with higher job scale is often used instead.
The idea remains valid, but at the moment it does not change much in practice.

### Stage-specific arguments

```bash
# [fill in noisy-logic-price setup]
```

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚖️ Poor tradeoff

Resolve Stage E curriculum inconsistency between documents.

curriculum_argument.md presents Stage E ("Main Arrivals + Noisy Logic Prices") as part of the formal intended 6-stage progression (lines 118-120: "This stage adds robustness..."), but curriculum_setup.md states it is "usually skipped" (line 232) and "does not change much in practice" (line 233).

This creates ambiguity for operators: the rationale document presents E as a necessary robustness step, while the execution guide recommends skipping it.

Additionally, the stage-specific arguments remain incomplete (line 238: "[fill in noisy-logic-price setup]"), which compounds the confusion.

Recommendation: Either (1) update curriculum_argument.md to acknowledge Stage E as optional with limited practical value, or (2) complete the Stage E specification here and remove the "usually skipped" disclaimer, or (3) clearly mark Stage E as "experimental/optional" in both documents with consistent language.

🧰 Tools
🪛 LanguageTool

[style] ~233-~233: For conciseness, consider replacing this expression with an adverb.
Context: ...ed instead. The idea remains valid, but at the moment it does not change much in practice. #...

(AT_THE_MOMENT)

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@curriculum_setup.md` around lines 228 - 239, Resolve the inconsistency
between curriculum_setup.md and curriculum_argument.md for "Stage E: Main
Arrivals + Noisy Logic Prices" by choosing one of the three fixes and applying
it consistently: (A) mark Stage E as optional/experimental in both documents by
adding an explicit "Optional/Experimental" label and a short justification and
keep the placeholder in curriculum_setup.md if you intend it optional; (B) if
you intend Stage E as required, remove the "usually skipped" remark and fill in
the missing stage-specific arguments in curriculum_setup.md (replace "[fill in
noisy-logic-price setup]" with the full noisy-logic-price configuration and
example commands) and update curriculum_argument.md to assert it is part of the
formal progression; or (C) retain it as optional but complete the specification
and add a clear "experimental/optional" notice in both files; ensure the
phrasing for "Stage E" is identical in curriculum_setup.md and
curriculum_argument.md so operators see the same status and provide the
noisy-logic-price setup snippet under the Stage E heading if you choose to keep
the stage runnable.

@rbx rbx merged commit 2f0de9e into FairRootGroup:master Jun 10, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants