Home | Tags | #p_akata

#p_akata

HTM+25

Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study

GCPR 2025

#p_akata

Learn more

RKG+25

Road Obstacle Video Segmentation

GCPR 2025

#p_akata

Learn more

BGA+25

SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions

ICCV 2025

#p_akata

Learn more

KCA+25

Scalable Ranked Preference Optimization for Text-to-Image Generation

ICCV 2025

#p_akata

Learn more

GAS+25a

Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models

Preprint (Oct. 2025)

#p_akata

Learn more

KXA+25

Training-Free Uncertainty Guidance for Complex Visual Tasks With MLLMs

Preprint (Oct. 2025)

#p_akata

Learn more

BPB+25

Stitch: Training-Free Position Control in Multimodal Diffusion Transformers

Preprint (Sep. 2025)

#p_akata

Learn more

GSS+25

Reference-Free Rating of LLM Responses via Latent Information

Preprint (Sep. 2025)

#p_akata

Learn more

EKD+25

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Preprint (Aug. 2025)

#p_akata

Learn more

TRB+25

WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs

ICML 2025

#p_akata

Learn more

SGB+25

Align-Then-Unlearn: Embedding Alignment for LLM Unlearning

MUGen @ICML 2025

#p_akata

Learn more

BRM+25

From Alexnet to Transformers: Measuring the Non-Linearity of Deep Neural Networks With Affine Optimal Transport

CVPR 2025

#p_akata

Learn more

DUR+25a

How to Merge Your Multimodal Models Over Time?

CVPR 2025

#p_akata

Learn more

KXG+25

COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-Training

CVPR 2025

#p_akata

Learn more

RAD+25

Context-Aware Multimodal Pretraining

CVPR 2025

#p_akata

Learn more

XKG+25

FLAIR: VLM With Fine-Grained Language-Informed Image Representations

CVPR 2025

#p_akata

Learn more

RBF+25

Time Series Representations for Classification Lie Hidden in Pretrained Vision Transformers

Preprint (Jun. 2025)

#p_akata

Learn more

SRG+25

Subspace-Boosted Model Merging

Preprint (Jun. 2025)

#p_akata

Learn more

KAS+25

LoFT: LoRA-Fused Training Dataset Generation With Few-Shot Guidance

Preprint (May. 2025)

#p_akata

Learn more

BGA25

Decoupling Angles and Strength in Low-Rank Adaptation

ICLR 2025

#p_akata

Learn more

BMA25

Tailoring Mixup to Data for Calibration

ICLR 2025

#p_akata

Learn more

GHA+25

Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)

ICLR 2025

#p_akata

Learn more

UER+25

Disentangled Representation Learning With the Gromov-Monge Gap

ICLR 2025

#p_akata #p_theis

Learn more

DUR+25

How to Merge Multimodal Models Over Time?

MCDC @ICLR 2025

#p_akata

Learn more

PKB+25

Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models

Preprint (Apr. 2025)

#p_akata

Learn more

GAS+25

A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models

Preprint (Mar. 2025)

#p_akata

Learn more

WAS+25

Discovering Chunks in Neural Embeddings for Interpretability

Preprint (Feb. 2025)

#p_akata

Learn more

BAR+25

How Should the Advancement of Large Language Models Affect the Practice of Science?

Proceedings of the National Academy of Sciences 122.5. Jan. 2025

#p_akata

Learn more

EKR+24

ReNO: Enhancing One-Step Text-to-Image Models Through Reward-Based Noise Optimization

NeurIPS 2024

#p_akata

Learn more

URD+24

A Practitioner's Guide to Real-World Continual Multimodal Pretraining

NeurIPS 2024

#p_akata

Learn more

HOF+24

Opening the Black Box: A Systematic Review on Explainable Artificial Intelligence in Remote Sensing

IEEE Geoscience and Remote Sensing Magazine 12.4. Dec. 2024

#p_akata #p_zhu

Learn more

BLK+24

Post-Hoc Probabilistic Vision-Language Models

Preprint (Dec. 2024)

#p_akata

Learn more

YXG+24

Conformable Convolution for Topologically Aware Learning of Complex Anatomical Structures

Preprint (Dec. 2024)

#p_akata #p_navab

Learn more

CMP+24

Geometry Fidelity for Spherical Images

ECCV 2024

#p_akata

Learn more

HKG+24

EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

ECCV 2024

#p_akata

Learn more

KBA+24

DataDream: Few-Shot Guided Dataset Generation

ECCV 2024

#p_akata

Learn more

TRH+24

Reflecting on the State of Rehearsal-Free Continual Learning With Pretrained Models

CoLLAs 2024

#p_akata

Learn more

DPA+24

SemioLLM: Assessing Large Language Models for Semiological Analysis in Epilepsy Research

AI4Science @ICML 2024

#p_akata

Learn more

BRA+24

ETHER: Efficient Finetuning of Large-Scale Models With Hyperplane Reflections

ICML 2024

#p_akata

Learn more

UER+24

Disentangled Representation Learning Through Geometry Preservation With the Gromov-Monge Gap

SPIGM @ICML 2024

#p_akata #p_theis

Learn more

YFG+23

SCOPE: Structural Continuity Preservation for Medical Image Segmentation

GRAIL @MICCAI 2023

#p_akata #p_navab

Learn more

YGX+23

SCOPE: Structural Continuity Preservation for Retinal Vessel Segmentation

GRAIL @MICCAI 2023

#p_akata #p_navab

Learn more