Search Results for author: Yihe Dong

Found 15 papers, 11 papers with code

Learned Feature Importance Scores for Automated Feature Engineering

no code implementations • 6 Jun 2024 • Yihe Dong, Sercan Arik, Nathanael Yoder, Tomas Pfister

Feature engineering has demonstrated substantial utility for many machine learning workflows, such as in the small data regime or when distribution shifts are severe.

Automated Feature Engineering Feature Engineering +2

Paper
Add Code

COSTAR: Improved Temporal Counterfactual Estimation with Self-Supervised Learning

1 code implementation • 1 Nov 2023 • Chuizheng Meng, Yihe Dong, Sercan Ö. Arik, Yan Liu, Tomas Pfister

Estimation of temporal counterfactual outcomes from observed history is crucial for decision-making in many domains such as healthcare and e-commerce, particularly when randomized controlled trials (RCTs) suffer from high cost or impracticality.

counterfactual Decision Making +2

33,186

Paper
Code

LANISTR: Multimodal Learning from Structured and Unstructured Data

1 code implementation • 26 May 2023 • Sayna Ebrahimi, Sercan O. Arik, Yihe Dong, Tomas Pfister

To bridge this gap, we propose LANISTR, an attention-based framework to learn from LANguage, Image, and STRuctured data.

Time Series

Paper
Code

SLM: End-to-end Feature Selection via Sparse Learnable Masks

no code implementations • 6 Apr 2023 • Yihe Dong, Sercan O. Arik

Feature selection has been widely used to alleviate compute requirements during training, elucidate model interpretability, and improve model generalizability.

feature selection

Paper
Add Code

Koopman Neural Forecaster for Time Series with Temporal Distribution Shifts

1 code implementation • 7 Oct 2022 • Rui Wang, Yihe Dong, Sercan Ö. Arik, Rose Yu

Temporal distributional shifts, with underlying dynamics changing over time, frequently occur in real-world time series and pose a fundamental challenge for deep neural networks (DNNs).

Time Series Time Series Forecasting

33,184

Paper
Code

Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth

1 code implementation • 5 Mar 2021 • Yihe Dong, Jean-Baptiste Cordonnier, Andreas Loukas

Attention-based architectures have become ubiquitous in machine learning, yet our understanding of the reasons for their effectiveness remains limited.

Inductive Bias

153

Paper
Code

COPT: Coordinated Optimal Transport on Graphs

no code implementations • NeurIPS 2020 • Yihe Dong, Will Sawin

We introduce COPT, a novel distance metric between graphs defined via an optimization routine, computing a coordinated pair of optimal transport maps simultaneously.

Graph Classification

Paper
Add Code

HNHN: Hypergraph Networks with Hyperedge Neurons

1 code implementation • 22 Jun 2020 • Yihe Dong, Will Sawin, Yoshua Bengio

Hypergraphs provide a natural representation for many real world datasets.

Hypergraph representations Representation Learning

Paper
Code

CoinPress: Practical Private Mean and Covariance Estimation

3 code implementations • NeurIPS 2020 • Sourav Biswas, Yihe Dong, Gautam Kamath, Jonathan Ullman

We present simple differentially private estimators for the mean and covariance of multivariate sub-Gaussian data that are accurate at small sample sizes.

Paper
Code

A Study of Performance of Optimal Transport

1 code implementation • 3 May 2020 • Yihe Dong, Yu Gao, Richard Peng, Ilya Razenshteyn, Saurabh Sawlani

We investigate the problem of efficiently computing optimal transport (OT) distances, which is equivalent to the node-capacitated minimum cost maximum flow problem in a bipartite graph.

Paper
Code

COPT: Coordinated Optimal Transport for Graph Sketching

1 code implementation • 9 Mar 2020 • Yihe Dong, Will Sawin

We introduce COPT, a novel distance metric between graphs defined via an optimization routine, computing a coordinated pair of optimal transport maps simultaneously.

Graph Classification

Paper
Code

Scalable Nearest Neighbor Search for Optimal Transport

1 code implementation • ICML 2020 • Arturs Backurs, Yihe Dong, Piotr Indyk, Ilya Razenshteyn, Tal Wagner

Our extensive experiments, on real-world text and image datasets, show that Flowtree improves over various baselines and existing methods in either running time or accuracy.

Data Structures and Algorithms

115

Paper
Code

Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection

1 code implementation • NeurIPS 2019 • Yihe Dong, Samuel B. Hopkins, Jerry Li

In robust mean estimation the goal is to estimate the mean $\mu$ of a distribution on $\mathbb{R}^d$ given $n$ independent samples, an $\varepsilon$-fraction of which have been corrupted by a malicious adversary.

Outlier Detection