Search Results for author: Bilgehan Sel

Found 7 papers, 1 papers with code

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning

no code implementations • 26 May 2024 • Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, QIngwei Lin, Alois Knoll, Ming Jin

In numerous reinforcement learning (RL) problems involving safety-critical systems, a key challenge lies in balancing multiple objectives while simultaneously meeting all stringent safety constraints.

Paper
Add Code

A CMDP-within-online framework for Meta-Safe Reinforcement Learning

no code implementations • 26 May 2024 • Vanshaj Khattar, Yuhao Ding, Bilgehan Sel, Javad Lavaei, Ming Jin

Meta-reinforcement learning has widely been used as a learning-to-learn framework to solve unseen tasks with limited experience.

Paper
Add Code

Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs

no code implementations • 21 May 2024 • Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming Jin

Large Language Models (LLMs) have shown remarkable capabilities in tasks such as summarization, arithmetic reasoning, and question answering.

Arithmetic Reasoning Decision Making +1

Paper
Add Code

Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation

2 code implementations • 2 May 2024 • Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, QIngwei Lin, Ming Jin, Alois Knoll

Ensuring the safety of Reinforcement Learning (RL) is crucial for its deployment in real-world applications.

Reinforcement Learning (RL) Safe Reinforcement Learning

866

Paper
Code

A Human-on-the-Loop Optimization Autoformalism Approach for Sustainability

no code implementations • 20 Aug 2023 • Ming Jin, Bilgehan Sel, Fnu Hardeep, Wotao Yin

This paper outlines a natural conversational approach to solving personalized energy-related problems using large language models (LLMs).

Paper
Add Code

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

no code implementations • 20 Aug 2023 • Bilgehan Sel, Ahmad Al-Tawaha, Vanshaj Khattar, Ruoxi Jia, Ming Jin

Current literature, aiming to surpass the "Chain-of-Thought" approach, often resorts to an external modus operandi involving halting, modifying, and then resuming the generation process to boost Large Language Models' (LLMs) reasoning capacities.

In-Context Learning

Paper
Add Code

On Solution Functions of Optimization: Universal Approximation and Covering Number Bounds

no code implementations • 2 Dec 2022 • Ming Jin, Vanshaj Khattar, Harshal Kaushik, Bilgehan Sel, Ruoxi Jia

We study the expressibility and learnability of convex optimization solution functions and their multi-layer architectural extension.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.