LFED: A Literary Fiction Evaluation Dataset for Large Language Models

tjunlp-lab/lfed 16 May 2024

The rapid evolution of large language models (LLMs) has ushered in the need for comprehensive assessments of their performance across various dimensions.

0
16 May 2024

Evaluating Algorithmic Bias in Models for Predicting Academic Performance of Filipino Students

pcla-code/2024-edm-bias 16 May 2024

The best-performing model reached AUC of 0. 75 and weighted F1-score of 0. 79.

0
16 May 2024

Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

ferry-li/si-sod 16 May 2024

This paper explores the size-invariance of evaluation metrics in Salient Object Detection (SOD), especially when multiple targets of diverse sizes co-exist in the same image.

3
16 May 2024

MarkLLM: An Open-Source Toolkit for LLM Watermarking

thu-bpm/markllm 16 May 2024

However, the abundance of LLM watermarking algorithms, their intricate mechanisms, and the complex evaluation procedures and perspectives pose challenges for researchers and the community to easily experiment with, understand, and assess the latest advancements.

28
16 May 2024

iDRAMA-Scored-2024: A Dataset of the Scored Social Media Platform from 2020 to 2023

idramalab/iDRAMA-scored-2024 16 May 2024

Online web communities often face bans for violating platform policies, encouraging their migration to alternative platforms.

0
16 May 2024

Many-Shot In-Context Learning in Multimodal Foundation Models

stanfordmlgroup/ManyICL 16 May 2024

We show that batching up to 50 queries can lead to performance improvements under zero-shot and many-shot ICL, with substantial gains in the zero-shot setting on multiple datasets, while drastically reducing per-query cost and latency.

3
16 May 2024

PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning

jaychempan/pir-clip 16 May 2024

Continuing with the above, we propose PIR-CLIP, a domain-specific CLIP-based framework for remote sensing image-text retrieval, to address semantic noise in remote sensing vision-language representations and further improve open-domain retrieval performance.

2
16 May 2024

DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection

hanssuny/diffam 16 May 2024

In this paper, we propose a novel face protection approach, dubbed DiffAM, which leverages the powerful generative ability of diffusion models to generate high-quality protected face images with adversarial makeup transferred from reference images.

5
16 May 2024

4D Panoptic Scene Graph Generation

Jingkang50/OpenPSG 16 May 2024

To facilitate research in this new area, we build a richly annotated PSG-4D dataset consisting of 3K RGB-D videos with a total of 1M frames, each of which is labeled with 4D panoptic segmentation masks as well as fine-grained, dynamic scene graphs.

395
16 May 2024

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

eszaher/manifold-integrated-gradients 16 May 2024

In this paper, we dive into the reliability concerns of Integrated Gradients (IG), a prevalent feature attribution method for black-box deep learning models.

0
16 May 2024