Certified papers at TMLR
@TmlrCert
Followers
481
Following
0
Media
0
Statuses
403
New #J2CCertification: Learning Deformable Body Interactions With Adaptive Spatial Tokenization Hao Wang, Yu Liu, Daniel Biggs, Haoru Wang, Jiandong Yu, Ping Huang https://t.co/2dEUMvxj0p
#mesh #meshes #modeling
openreview.net
Simulating interactions between deformable bodies is vital in fields like material science, mechanical design, and robotics. While learning-based methods with Graph Neural Networks (GNNs) are...
0
0
0
New #ExpertCertification: Overcoming Non-stationary Dynamics with Evidential Proximal Policy Optimization Abdullah Akgül, Gulcin Baykal, Manuel Haussmann, Melih Kandemir https://t.co/PkLeunTFJC
#critic #reinforcement #exploration
openreview.net
Continuous control of non-stationary environments is a major challenge for deep reinforcement learning algorithms. The time-dependency of the state transition dynamics aggravates the notorious...
0
0
0
New #J2CCertification: Testing with Non-identically Distributed Samples Shivam Garg, Chirag Pabbaraju, Kirankumar Shiragur, Gregory Valiant https://t.co/Pzmdg1hCPe
#distributional #distributions #distribution
openreview.net
We examine the extent to which sublinear-sample property testing and estimation applies to settings where samples are independently but not identically distributed. Specifically, we consider the...
0
0
0
New #SurveyCertification: Neural Spatiotemporal Point Processes: Trends and Challenges Sumantrak Mukherjee, Mouad Elhamdi, George Mohler et al. https://t.co/Av1XiTkgah
#spatiotemporal #events #event
openreview.net
Spatiotemporal point processes (STPPs) are probabilistic models for events occurring in continuous space and time. Real-world event data often exhibits intricate dependencies and heterogeneous...
0
0
0
New #J2CCertification: DNOD: Deformable Neural Operators for Object Detection in SAR Images GVS Mothish, J Rishi, Shobhit Kumar Shukla, Deepak Subramani https://t.co/sInQYC6krl
#sar #radar #encoder
openreview.net
We introduce a deep neural operator framework aimed at object detection in remotely sensed Synthetic Aperture Radar (SAR) images. Recent research highlights the impressive performance of the...
0
0
0
New #FeaturedCertification,J2CCertification: PCF Learned Sort: a Learning Augmented Sort Algorithm with $\mathcal{O}(n \log\log n)$ Expected C... Atsuki Sato, Yusuke Matsui https://t.co/PeOh9lccmM
#sorting #sort #sorts
openreview.net
Sorting is one of the most fundamental algorithms in computer science. Recently, Learned Sorts, which use machine learning to improve sorting speed, have attracted attention. While existing studies...
0
0
0
New #J2CCertification: TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models Yao Xiao, Qiqian Fu, Heyi Tao, Yuqun Wu, Zhen Zhu, Derek Hoiem https://t.co/g2MErXwX4c
#text #textregion #visual
openreview.net
Image-text models excel at image-level tasks but struggle with detailed visual understanding. While these models provide strong visual-language alignment, segmentation models like SAM2 offer...
0
0
1
New #J2CCertification: Encoder-only Next Token Prediction Ethan Ewer, Daewon Chae, Thomas Zeng, Jinkyu Kim, Kangwook Lee https://t.co/mnFdSwKmPI
#encoder #decoder #attention
openreview.net
Next-token prediction is conventionally done using decoder-only Transformers with causal attention, as this approach allows for efficient reuse of keys and values. What if we were not...
0
1
1
New #J2CCertification: Efficient and Unbiased Sampling from Boltzmann Distributions via Variance-Tuned Diffusion Models Fengzhe Zhang, Laurence Illing Midgley, José Miguel Hernández-Lobato https://t.co/IyiVe5u83y
#diffusion #boltzmann #sampling
openreview.net
Score-based diffusion models (SBDMs) are powerful amortized samplers for Boltzmann distributions; however, imperfect score estimates bias downstream Monte Carlo estimates. Classical importance...
0
0
0
New #ReproducibilityCertification: Kernel Space Conditional Distribution Alignment for Improving Group Fairness in Deepfake Detection Sayantan Das, Mojtaba Kolahdouzi, Ali Etemad https://t.co/U0biKQsuKs
#deepfake #fairness #faceforensics
openreview.net
We introduce FairAlign, a new method to reduce bias and improve group fairness in deepfake detection by aligning conditional distributions of embeddings in a high-dimensional kernel space. Our...
0
0
0
New #J2CCertification: Expert Routing with Synthetic Data for Domain Incremental Learning Yewon Byun, Sanket Vaibhav Mehta, Saurabh Garg et al. https://t.co/bUaRpZQJ9E
#lifelong #forgetting #ensemble
openreview.net
In many real-world settings, regulations and economic incentives permit the sharing of models but not data across institutional boundaries. In such scenarios, practitioners might hope to adapt...
0
0
0
New #J2CCertification: Models of human preference for learning reward functions W. Bradley Knox, Stephane Hatgis-Kessell, Serena Booth et al. https://t.co/nNNFxsCBN3
#reward #reinforcement #regret
openreview.net
The utility of reinforcement learning is limited by the alignment of reward functions with the interests of human stakeholders. One promising method for alignment is to learn the reward function...
0
0
0
New #J2CCertification: Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization Fang Kong, XiangCheng Zhang, Baoxiang Wang, Shuai Li https://t.co/275VBnYbt7
#adversarial #bandit #optimization
openreview.net
Learning Markov decision processes (MDP) in an adversarial environment has been a challenging problem. The problem becomes even more challenging with function approximation since the underlying...
0
0
0
New #J2CCertification: RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao et al. https://t.co/lCeDjxPQTx
#robotic #robot #robots
openreview.net
The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by...
0
0
0
New #J2CCertification: MMD-Regularized Unbalanced Optimal Transport Piyushi Manupriya, SakethaNath Jagarlapudi, Pratik Jawanpuria https://t.co/Pse2l5RnPd
#regularization #transport #regularized
openreview.net
We study the unbalanced optimal transport (UOT) problem, where the marginal constraints are enforced using Maximum Mean Discrepancy (MMD) regularization. Our work is motivated by the observation...
0
0
1
New #J2CCertification: Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models Adyasha Maharana, Amita Kamath, Christopher Clark, Mohit Bansal, Aniruddha Kembhavi https://t.co/vjlPrJO8um
#benchmark #tasks #cococon
openreview.net
As general purpose vision models get increasingly effective at a wide set of tasks, it is imperative that they be consistent across the tasks they support. Inconsistent AI models are considered...
0
0
0
New #J2CCertification: MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments Spyros Gidaris, Andrei Bursuc, Oriane Siméoni et al. https://t.co/se6I87BWRZ
#supervised #vision #training
openreview.net
Self-supervised learning can be used for mitigating the greedy needs of Vision Transformer networks for very large fully-annotated datasets. Different classes of self-supervised learning offer...
0
0
0
New #J2CCertification: Optimization with Access to Auxiliary Information El Mahdi Chayti, Sai Praneeth Karimireddy https://t.co/eBmm4U5APz
#optimization #minimizing #gradients
openreview.net
We investigate the fundamental optimization question of minimizing a \emph{target} function $f(x)$, whose gradients are expensive to compute or have limited availability, given access to some...
0
0
0
New #J2CCertification: Voyager: An Open-Ended Embodied Agent with Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang et al. https://t.co/yUxJRbIAgd
#learned #exploration #skills
openreview.net
We introduce Voyager, the first LLM-powered embodied lifelong learning agent in an open-ended world that continuously explores, acquires diverse skills, and makes novel discoveries without human...
0
0
0
New #J2CCertification: Variance-aware decision making with linear function approximation under heavy-tailed rewards Xiang Li, Qiang Sun https://t.co/cfvAq1tg3T
#bandits #rewards #reward
openreview.net
This paper studies how to achieve variance-aware regrets for online decision-making in the presence of heavy-tailed rewards with only finite variances. For linear stochastic bandits, we address...
0
0
0