Agentic Reinforced and Operational Workflow

Ayush Maurya; Deependra Bahadur Maurya

doi:https://doi.org/10.64388/IREV9I2-1710430-859

Agentic Reinforced and Operational Workflow

Author(s): Ayush Maurya; Deependra Bahadur Maurya
Paper ID: 1710430
Page: 221-229
Published Date: 05-09-2025
Published In: Iconic Research And Engineering Journals
Publisher: IRE Journals
e-ISSN: 2456-8880
Volume/Issue: Volume 9 Issue 3 September-2025

Download

Abstract

We present Agentic Reinforced and Operational Workflow (AROW), a novel multi-agent system that integrates large pretrained language models (PLMs) with cooperative multi-agent reinforcement learning (MARL) to perform complex tasks with improved coordination and factual reliability. The system features a Neural Execution Planner (NEP) that parses a user query into subgoals, a decentralized Reinforcement Distributor to allocate credit, and a PLM-based supervisor that assigns subtasks to specialized agents via JSON-formatted instructions. Agents communicate through a shared-memory “blackboard” for intermediate results. During execution, each agent’s output is validated (e.g. by verifier agents and RAG grounding) and assigned a quality score $x_i\in{0,1}$ for reinforcement. Learning employs cooperative MARL techniques: we use QMIX’s monotonic value-mixing network to learn a global action-value and COMA’s counterfactual baseline for credit assignment[1][2]. For hallucination mitigation, outputs are constrained by strict JSON schema (enforced via prompt priming[3]), cross-checked against retrieved documents (RAG), and subject to provenance tracking and reward penalties for unverifiable claims[4][5]. We evaluate AROW on two fronts: (1) synthetic cooperative simulations (e.g. multi-robot resource-gathering tasks[6]) to measure coordination and credit learning, and (2) document-grounded QA challenges to test fact-consistency. Example JSON instructions, agent responses, and verifier behavior are provided. Results (theoretical) indicate enhanced task performance and reduced hallucinations compared to baselines. The paper emphasizes the practical integration of agentic architectures with MARL to achieve scalable, reliable autonomous workflows.

Citations

IRE Journals:
Ayush Maurya, Deependra Bahadur Maurya "Agentic Reinforced and Operational Workflow" Iconic Research And Engineering Journals Volume 9 Issue 3 2025 Page 221-229 https://doi.org/10.64388/IREV9I2-1710430-859

IEEE:
Ayush Maurya, Deependra Bahadur Maurya "Agentic Reinforced and Operational Workflow" Iconic Research And Engineering Journals, 9(3) https://doi.org/10.64388/IREV9I2-1710430-859

Agentic Reinforced and Operational Workflow

Abstract

Citations

About IRE Journals

Important Links

For Authors

Contact Us For Help

Connect With Us