Search Results for author: Bilgehan Sel

Found 7 papers, 1 papers with code

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning

no code implementations26 May 2024 Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, QIngwei Lin, Alois Knoll, Ming Jin

In numerous reinforcement learning (RL) problems involving safety-critical systems, a key challenge lies in balancing multiple objectives while simultaneously meeting all stringent safety constraints.

A CMDP-within-online framework for Meta-Safe Reinforcement Learning

no code implementations26 May 2024 Vanshaj Khattar, Yuhao Ding, Bilgehan Sel, Javad Lavaei, Ming Jin

Meta-reinforcement learning has widely been used as a learning-to-learn framework to solve unseen tasks with limited experience.

Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs

no code implementations21 May 2024 Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming Jin

Large Language Models (LLMs) have shown remarkable capabilities in tasks such as summarization, arithmetic reasoning, and question answering.

Arithmetic Reasoning Decision Making +1

A Human-on-the-Loop Optimization Autoformalism Approach for Sustainability

no code implementations20 Aug 2023 Ming Jin, Bilgehan Sel, Fnu Hardeep, Wotao Yin

This paper outlines a natural conversational approach to solving personalized energy-related problems using large language models (LLMs).

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

no code implementations20 Aug 2023 Bilgehan Sel, Ahmad Al-Tawaha, Vanshaj Khattar, Ruoxi Jia, Ming Jin

Current literature, aiming to surpass the "Chain-of-Thought" approach, often resorts to an external modus operandi involving halting, modifying, and then resuming the generation process to boost Large Language Models' (LLMs) reasoning capacities.

In-Context Learning

On Solution Functions of Optimization: Universal Approximation and Covering Number Bounds

no code implementations2 Dec 2022 Ming Jin, Vanshaj Khattar, Harshal Kaushik, Bilgehan Sel, Ruoxi Jia

We study the expressibility and learnability of convex optimization solution functions and their multi-layer architectural extension.

Cannot find the paper you are looking for? You can Submit a new open access paper.