hackquest logo

0g-promptRL

PromptRL is a reinforcement learning project for learning cost-aware LLM configurations. It treats prompt strategy selection as a Q-learning problem and searches for the best combination of model.

視頻

專案圖片 1
專案圖片 2
專案圖片 3

技術堆疊

React
Next
Node

描述

PromptRL is a reinforcement learning project for learning cost-aware LLM configurations. It treats prompt strategy selection as a Q-learning problem and searches for the best combination of model, reasoning mode, and persona for different task difficulty levels. Instead of sending every task through the same model and prompt style, PromptRL explores a discrete action space and optimizes for output quality relative to inference cost.

Tasks Included

  • easy: short tweet generation

  • medium: constrained LinkedIn post generation

  • hard: beginner-friendly explanation of quantum computing

團隊負責人
VVidip Ghosh
專案連結
行業
AIInfra