Skip to content

DWCTOD/cv-arxiv-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Updated on 2024.07.18

Video_Classification

Publish Date Title Authors PDF Code
2024-07-16 Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling Jaehyeok Kim et.al. 2407.11962v1 null
2024-07-16 A Transformer-based Approach for Augmenting Software Engineering Chatbots Datasets Ahmad Abdellatif et.al. 2407.11955v1 null
2024-07-16 Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation Olga Zatsarynna et.al. 2407.11954v1 null
2024-07-16 Temporally Consistent Stereo Matching Jiaxi Zeng et.al. 2407.11950v1 link
2024-07-17 Hierarchical Separable Video Transformer for Snapshot Compressive Imaging Ping Wang et.al. 2407.11946v2 link
2024-07-16 Tackling Oversmoothing in GNN via Graph Sparsification: A Truss-based Approach Tanvir Hossain et.al. 2407.11928v1 null
2024-07-16 The Strength of Bisymmetric Modes in SDSS-IV/MaNGA Barred Galaxy Kinematics Brian DiGiorgio Zanger et.al. 2407.11908v1 null
2024-07-16 GraphFM: A Scalable Framework for Multi-Graph Pretraining Divyansha Lachi et.al. 2407.11907v1 null
2024-07-16 SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge Hao Ding et.al. 2407.11906v1 null
2024-07-16 Automated production of batched unclonable micro-patterns anti-counterfeiting labels with strong robustness and rapid recognition speed Yuzheng He et.al. 2407.11886v1 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964v1 link
2024-07-15 InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models Nirat Saini et.al. 2407.10958v1 null
2024-07-15 MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models Chengguang Gan et.al. 2407.10953v1 null
2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937v1 link
2024-07-15 Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together Dilara Soylu et.al. 2407.10930v1 null
2024-07-15 In-Loop Filtering via Trained Look-Up Tables Zhuoyuan Li et.al. 2407.10926v1 null
2024-07-15 A Dual-Attention Aware Deep Convolutional Neural Network for Early Alzheimer's Detection Pandiyaraju V et.al. 2407.10921v1 null
2024-07-16 DataDream: Few-shot Guided Dataset Generation Jae Myung Kim et.al. 2407.10910v2 link
2024-07-15 Interpreting Hand gestures using Object Detection and Digits Classification Sangeetha K et.al. 2407.10902v1 null
2024-07-15 Leveraging Multimodal CycleGAN for the Generation of Anatomically Accurate Synthetic CT Scans from MRIs Leonardo Crespi et.al. 2407.10888v1 null
2024-07-12 Non-Hermitian Origin of Wannier Localizability and Detachable Topological Boundary States Daichi Nakamura et.al. 2407.09458v1 null
2024-07-12 Let Me DeCode You: Decoder Conditioning with Tabular Data Tomasz Szczepański et.al. 2407.09437v1 link
2024-07-12 Rethinking temporal self-similarity for repetitive action counting Yanan Luo et.al. 2407.09431v1 null
2024-07-12 TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Hang Zou et.al. 2407.09424v1 null
2024-07-12 A grid of self-consistent MSG (MARCS-StaticWeather-GGchem) cool stellar, sub-stellar, and exoplanetary model atmospheres Uffe G. Jørgensen et.al. 2407.09397v1 null
2024-07-12 Open-Canopy: A Country-Scale Benchmark for Canopy Height Estimation at Very High Resolution Fajwel Fogel et.al. 2407.09392v1 link
2024-07-12 Radiance Fields from Photons Sacha Jungerman et.al. 2407.09386v1 null
2024-07-12 Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation Zhilin Zhu et.al. 2407.09367v1 link
2024-07-12 Novel clustered federated learning based on local loss Endong Gu et.al. 2407.09360v1 link
2024-07-12 Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems Ziyuan Luo et.al. 2407.09352v1 null
2024-07-11 Video Diffusion Alignment via Reward Gradients Mihir Prabhudesai et.al. 2407.08737v1 link
2024-07-11 Real-Time Anomaly Detection and Reactive Planning with Large Language Models Rohan Sinha et.al. 2407.08735v1 null
2024-07-11 WhisperNetV2: SlowFast Siamese Network For Lip-Based Biometrics Abdollah Zakeri et.al. 2407.08717v1 null
2024-07-11 Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices Dina Hussein et.al. 2407.08715v1 null
2024-07-11 Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware James Seekings et.al. 2407.08704v1 null
2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701v1 null
2024-07-11 ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions Jiu Feng et.al. 2407.08691v1 link
2024-07-11 Generalizable Implicit Motion Modeling for Video Frame Interpolation Zujin Guo et.al. 2407.08680v1 null
2024-07-11 Still-Moving: Customized Video Generation without Customized Video Data Hila Chefer et.al. 2407.08674v1 null
2024-07-11 NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning Yi Zhang et.al. 2407.08672v1 null
2024-07-10 LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models Feng Li et.al. 2407.07895v1 link
2024-07-10 Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation Tao Chen et.al. 2407.07884v1 null
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860v1 null
2024-07-11 Functional Assessment of Cerebral Capillaries using Single Capillary Reporters in Ultrasound Localization Microscopy Stephen A Lee et.al. 2407.07857v2 null
2024-07-10 Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification Mei Qiu et.al. 2407.07842v1 null
2024-07-10 Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective Shengjia Chen et.al. 2407.07841v1 link
2024-07-10 Probe and Prejudice: Classification of compact objects and model comparison using EOS knowledge Hauke Koehn et.al. 2407.07837v1 null
2024-07-10 RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement Honglie Chen et.al. 2407.07825v1 null
2024-07-10 New Gravitational Wave Discoveries Enabled by Machine Learning Alexandra E. Koloniari et.al. 2407.07820v1 null
2024-07-10 The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others Daniel Sikar et.al. 2407.07818v1 null
2024-07-09 V-VIPE: Variational View Invariant Pose Embedding Mara Levy et.al. 2407.07092v1 null
2024-07-09 Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic Ruochen Jin et.al. 2407.07089v1 link
2024-07-09 MoSt-DSA: Modeling Motion and Structural Interactions for Direct Multi-Frame Interpolation in DSA Images Ziyang Xu et.al. 2407.07078v1 link
2024-07-09 MADE-for-ASD: A Multi-Atlas Deep Ensemble Network for Diagnosing Autism Spectrum Disorder Md Rakibul Hasan et.al. 2407.07076v1 null
2024-07-10 CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement Wei Wang et.al. 2407.07056v2 null
2024-07-09 Latent Space Imaging Matheus Souza et.al. 2407.07052v1 null
2024-07-09 Simple and Interpretable Probabilistic Classifiers for Knowledge Graphs Christian Riefolo et.al. 2407.07045v1 null
2024-07-09 Free Fermionic Constructions of Heterotic Strings Ioannis Florakis et.al. 2407.07034v1 null
2024-07-09 Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition Daiqing Wu et.al. 2407.07026v1 null
2024-07-09 Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization Jeongseok Hyun et.al. 2407.07024v1 link
2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189v1 link
2024-07-08 Classification of Cellular Automata based on the Hamming distance Gaspar Alfaro et.al. 2407.06175v1 null
2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174v1 null
2024-07-08 PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes Mohammad Reza Karimi Dastjerdi et.al. 2407.06150v1 null
2024-07-08 Physics-informed machine learning approaches to reactor antineutrino detection Sophia Farrell et.al. 2407.06139v1 null
2024-07-08 Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities Avinash Anand et.al. 2407.06125v1 null
2024-07-08 Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation Xinyu Bai et.al. 2407.06095v1 null
2024-07-08 ERR@HRI 2024 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Interactions Micol Spitale et.al. 2407.06094v1 null
2024-07-08 Artificial Intuition: Efficient Classification of Scientific Abstracts Harsh Sakhrani et.al. 2407.06093v1 null
2024-07-08 Assessing Cardiomegaly in Dogs Using a Simple CNN Model Nikhil Deekonda et.al. 2407.06092v1 null
2024-07-05 VCoME: Verbal Video Composition with Multimodal Editing Effects Weibo Gong et.al. 2407.04697v1 null
2024-07-05 Enhancing Vehicle Re-identification and Matching for Weaving Analysis Mei Qiu et.al. 2407.04688v1 null
2024-07-05 Embracing Massive Medical Data Yu-Cheng Chou et.al. 2407.04687v1 link
2024-07-05 Is plantar thermography a valid digital biomarker for characterising diabetic foot ulceration risk? Akshay Jagadeesh et.al. 2407.04676v1 null
2024-07-05 AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Yuhan Zhu et.al. 2407.04603v1 null
2024-07-05 Multimodal Classification via Modal-Aware Interactive Enhancement Qing-Yuan Jiang et.al. 2407.04587v1 null
2024-07-05 A Degree Bound for Planar Functions Christof Beierle et.al. 2407.04570v1 null
2024-07-05 Pencils of plane cubics with one base point Riccardo Moschetti et.al. 2407.04569v1 null
2024-07-05 Anticipating Solar Flares Hugh S. Hudson et.al. 2407.04567v1 null
2024-07-05 Real Time Emotion Analysis Using Deep Learning for Education, Entertainment, and Beyond Abhilash Khuntia et.al. 2407.04560v1 null
2024-07-03 InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Pan Zhang et.al. 2407.03320v1 link
2024-07-03 Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations Trevor Ablett et.al. 2407.03311v1 link
2024-07-03 Accelerated Proton Resonance Frequency-based Magnetic Resonance Thermometry by Optimized Deep Learning Method Sijie Xu et.al. 2407.03308v1 link
2024-07-03 HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization Yucheng Tang et.al. 2407.03307v1 null
2024-07-03 VCHAR:Variance-Driven Complex Human Activity Recognition framework with Generative Representation Yuan Sun et.al. 2407.03291v1 null
2024-07-03 Using Photoplethysmography to Detect Real-time Blood Pressure Changes with a Calibration-free Deep Learning Model Jingyuan Hong et.al. 2407.03274v1 null
2024-07-03 Modern Neighborhood Components Analysis: A Deep Tabular Baseline Two Decades Later Han-Jia Ye et.al. 2407.03257v1 link
2024-07-03 STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data Kheir Eddine Daouadi et.al. 2407.03253v1 null
2024-07-03 ACTRESS: Active Retraining for Semi-supervised Visual Grounding Weitai Kang et.al. 2407.03251v1 null
2024-07-04 TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach Weikun Peng et.al. 2407.03245v2 null
2024-07-02 Characterizing the Interpretability of Attention Maps in Digital Pathology Tomé Albuquerque et.al. 2407.02484v1 null
2024-07-02 Ensemble of pre-trained language models and data augmentation for hate speech detection from Arabic tweets Kheir Eddine Daouadi et.al. 2407.02448v1 null
2024-07-02 PLeaS -- Merging Models with Permutations and Least Squares Anshul Nasery et.al. 2407.02447v1 null
2024-07-02 Evaluating the Robustness of Adverse Drug Event Classification Models Using Templates Dorothea MacPhail et.al. 2407.02432v1 null
2024-07-02 AXIAL: Attention-based eXplainability for Interpretable Alzheimer's Localized Diagnosis using 2D CNNs on 3D MRI brain scans Gabriele Lozupone et.al. 2407.02418v1 link
2024-07-03 Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs Jinmin Li et.al. 2407.02411v2 null
2024-07-02 Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-Drones Lorenzo Lamberti et.al. 2407.02405v1 null
2024-07-03 A neural networks method to search for long transient gravitational waves Francesca Attadio et.al. 2407.02391v2 null
2024-07-02 Real HSI-MSI-PAN image dataset for the hyperspectral/multi-spectral/panchromatic image fusion and super-resolution fields Shuangliang Li et.al. 2407.02387v1 link
2024-07-02 OpenSlot: Mixed Open-set Recognition with Object-centric Learning Xu Yin et.al. 2407.02386v1 null
2024-06-28 Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs Sukmin Yun et.al. 2406.20098v1 link
2024-06-28 LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression Jieneng Chen et.al. 2406.20092v1 link
2024-06-28 Minimax And Adaptive Transfer Learning for Nonparametric Classification under Distributed Differential Privacy Constraints Arnab Auddy et.al. 2406.20088v1 null
2024-06-28 Extreme horizon equation Wojciech Kamiński et.al. 2406.20068v1 null
2024-06-28 Modeling and LQR Control of Insect Sized Flapping Wing Robot Daksh Dhingra et.al. 2406.20061v1 null
2024-06-28 Pairwise Difference Learning for Classification Mohamed Karim Belaid et.al. 2406.20031v1 link
2024-06-28 On the Trade-off between Flatness and Optimization in Distributed Learning Ying Cao et.al. 2406.20006v1 null
2024-06-28 Malaria Cell Detection Using Deep Neural Networks Saurabh Sawant et.al. 2406.20005v1 null
2024-06-28 Impact of Initialization on Intra-subject Pediatric Brain MR Image Registration: A Comparative Analysis between SyN ANTs and Deep Learning-Based Approaches Andjela Dimitrijevic et.al. 2406.19943v1 link
2024-07-01 GRACE: Graph-Regularized Attentive Convolutional Entanglement with Laplacian Smoothing for Robust DeepFake Video Detection Chih-Chung Hsu et.al. 2406.19941v2 link
2024-06-27 ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos Jr-Jen Chen et.al. 2406.19392v1 link
2024-06-27 Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads Ali Khaleghi Rahimian et.al. 2406.19391v1 link
2024-06-27 OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding Tao Zhang et.al. 2406.19389v1 null
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369v1 null
2024-06-27 IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language Lucky Susanto et.al. 2406.19349v1 null
2024-06-27 Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation Yushun Tang et.al. 2406.19341v1 null
2024-06-28 LiverUSRecon: Automatic 3D Reconstruction and Volumetry of the Liver with a Few Partial Ultrasound Scans Kaushalya Sivayogaraj et.al. 2406.19336v2 null
2024-06-27 PNeRV: A Polynomial Neural Representation for Videos Sonam Gupta et.al. 2406.19299v1 null
2024-06-27 Leveraging Contrastive Learning for Enhanced Node Representations in Tokenized Graph Transformers Jinsong Chen et.al. 2406.19258v1 null
2024-06-27 Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment Hao Fei et.al. 2406.19255v1 null
2024-06-26 Towards Compositionality in Concept Learning Adam Stein et.al. 2406.18534v1 link
2024-06-26 MatchTime: Towards Automatic Soccer Game Commentary Generation Jiayuan Rao et.al. 2406.18530v1 null
2024-06-26 MultiDiff: Consistent Novel View Synthesis from a Single Image Norman Müller et.al. 2406.18524v1 null
2024-06-26 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation Shenghai Yuan et.al. 2406.18522v1 null
2024-06-27 Distinguishing mechanisms of social contagion from local network view Elsa Andres et.al. 2406.18519v2 null
2024-06-26 Assessment of Clonal Hematopoiesis of Indeterminate Potential from Cardiac Magnetic Resonance Imaging using Deep Learning in a Cardio-oncology Population Sangeon Ryu et.al. 2406.18508v1 null
2024-06-26 Robust Surgical Phase Recognition From Annotation Efficient Supervision Or Rubin et.al. 2406.18481v1 null
2024-06-26 Universal Anomaly Detection at the LHC: Transforming Optimal Classifiers and the DDD Method Sascha Caron et.al. 2406.18469v1 null
2024-06-26 An Autotuning-based Optimization Framework for Mixed-kernel SVM Classifications in Smart Pixel Datasets and Heterojunction Transistors Xingfu Wu et.al. 2406.18445v1 null
2024-06-26 Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling Abril Corona-Figueroa et.al. 2406.18422v1 null
2024-06-25 Text-Animator: Controllable Visual Text Video Generation Lin Liu et.al. 2406.17777v1 null
2024-06-25 MotionBooth: Motion-Aware Customized Text-to-Video Generation Jianzong Wu et.al. 2406.17758v1 null
2024-06-25 Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation Tushar Prasanna Swaminathan et.al. 2406.17749v1 null
2024-06-25 Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning Arijit Sehanobish et.al. 2406.17740v1 null
2024-06-25 Mask-Guided Attention U-Net for Enhanced Neonatal Brain Extraction and Image Preprocessing Bahram Jafrasteh et.al. 2406.17709v1 link
2024-06-25 SurgeMOD: Translating image-space tissue motions into vision-based surgical forces Mikel De Iturrate Reyzabal et.al. 2406.17707v1 link
2024-06-25 Dualities for universal (co)acting Hopf monoids Ana Agore et.al. 2406.17684v1 null
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679v1 null
2024-06-25 Lifting of locally initial objects and universal (co)acting Hopf algebras Ana Agore et.al. 2406.17677v1 null
2024-06-25 Brain Tumor Classification using Vision Transformer with Selective Cross-Attention Mechanism and Feature Calibration Mohammad Ali Labbaf Khaniki et.al. 2406.17670v1 null
2024-06-24 StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal Chongjie Ye et.al. 2406.16864v1 null
2024-06-24 FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models Haonan Qiu et.al. 2406.16863v1 link
2024-06-24 Dreamitate: Real-World Visuomotor Policy Learning via Video Generation Junbang Liang et.al. 2406.16862v1 null
2024-06-24 Long Context Transfer from Language to Vision Peiyuan Zhang et.al. 2406.16852v1 link
2024-06-24 Unsupervised Domain Adaptation for Pediatric Brain Tumor Segmentation Jingru Fu et.al. 2406.16848v1 null
2024-06-24 Exploring Factual Entailment with NLI: A News Media Study Guy Mor-Lan et.al. 2406.16842v1 null
2024-06-24 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837v1 null
2024-06-24 USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$onversations Mounika Marreddy et.al. 2406.16833v1 null
2024-06-24 The classification of simple complex Lie superalgebras of polynomial vector fields and their deformations Dimitry Leites et.al. 2406.16760v1 null
2024-06-24 The MRI Scanner as a Diagnostic: Image-less Active Sampling Yuning Du et.al. 2406.16754v1 null
2024-06-21 Full-Scale Indexing and Semantic Annotation of CT Imaging: Boosting FAIRness Hannes Ulrich et.al. 2406.15340v1 null
2024-06-21 Image Conductor: Precision Control for Interactive Video Synthesis Yaowei Li et.al. 2406.15339v1 null
2024-06-21 An End-to-End, Segmentation-Free, Arabic Handwritten Recognition Model on KHATT Sondos Aabed et.al. 2406.15329v1 null
2024-06-21 Fine-grained Attention in Hierarchical Transformers for Tabular Time-series Raphael Azorin et.al. 2406.15327v1 link
2024-06-21 NLP-KG: A System for Exploratory Search of Scientific Literature in Natural Language Processing Tim Schopf et.al. 2406.15294v1 link
2024-06-21 Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics Weijia Zhang et.al. 2406.15264v1 null
2024-06-24 VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Xuan He et.al. 2406.15252v2 null
2024-06-21 Retrieval Augmented Zero-Shot Text Classification Tassallah Abdullahi et.al. 2406.15241v1 null
2024-06-21 Model Equivalences Michael Benedikt et.al. 2406.15235v1 null
2024-06-21 Rate-Splitting Multiple Access for Overloaded Multi-group Multicast: A First Experimental Study Xinze Lyu et.al. 2406.15217v1 null
2024-06-20 A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models Xincheng Shuai et.al. 2406.14555v1 link
2024-06-21 Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation Eyal Michaeli et.al. 2406.14551v2 link
2024-06-20 IRASim: Learning Interactive Real-Robot Action Simulators Fangqi Zhu et.al. 2406.14540v1 null
2024-06-20 Epicardium Prompt-guided Real-time Cardiac Ultrasound Frame-to-volume Registration Long Lei et.al. 2406.14534v1 link
2024-06-20 Local symmetries in partially ordered sets Christoph Minz et.al. 2406.14533v1 null
2024-06-20 Fantastic Copyrighted Beasts and How (Not) to Generate Them Luxi He et.al. 2406.14526v1 null
2024-06-20 MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding Xinyu Fang et.al. 2406.14515v1 link
2024-06-20 V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data Rotem Shalev-Arkushin et.al. 2406.14510v1 null
2024-06-20 LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors Sheikh Asif Imran et.al. 2406.14498v1 link
2024-06-20 African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification Gregor Geigle et.al. 2406.14496v1 null
2024-06-18 DrVideo: Document Retrieval Based Long Video Understanding Ziyu Ma et.al. 2406.12846v1 null
2024-06-18 LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging Jinuk Kim et.al. 2406.12837v1 link
2024-06-18 GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation Ci-Siang Lin et.al. 2406.12834v1 null
2024-06-18 VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing Jing Gu et.al. 2406.12831v1 null
2024-06-18 Neural Approximate Mirror Maps for Constrained Diffusion Models Berthy T. Feng et.al. 2406.12816v1 null
2024-06-18 Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation Nikolas Koutsoubis et.al. 2406.12815v1 link
2024-06-18 Probabilistic Temporal Prediction of Continuous Disease Trajectories and Treatment Effects Using Neural SDEs Joshua Durso-Finley et.al. 2406.12807v1 null
2024-06-18 Composited-Nested-Learning with Data Augmentation for Nested Named Entity Recognition Xingming Liao et.al. 2406.12779v1 null
2024-06-18 Medvedev degrees of subshifts on groups Sebastián Barbieri et.al. 2406.12777v1 null
2024-06-18 Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video Xiangming Zhu et.al. 2406.12769v1 null
2024-06-17 Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% Lei Zhu et.al. 2406.11837v1 link
2024-06-17 Spectral Introspection Identifies Group Training Dynamics in Deep Neural Networks for Neuroimaging Bradley T. Baker et.al. 2406.11825v1 null
2024-06-17 Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation Alexander Raistrick et.al. 2406.11824v1 null
2024-06-17 VideoLLM-online: Online Video Large Language Model for Streaming Video Joya Chen et.al. 2406.11816v1 null
2024-06-17 Faces of Experimental Pain: Transferability of Deep Learned Heat Pain Features to Electrical Pain Pooja Prajod et.al. 2406.11808v1 null
2024-06-17 Mix-Domain Contrastive Learning for Unpaired H&E-to-IHC Stain Translation Song Wang et.al. 2406.11799v1 null
2024-06-17 CELL your Model: Contrastive Explanation Methods for Large Language Models Ronny Luss et.al. 2406.11785v1 null
2024-06-17 Task Me Anything Jieyu Zhang et.al. 2406.11775v1 link
2024-06-17 Domain Generalization for In-Orbit 6D Pose Estimation Antoine Legrand et.al. 2406.11743v1 null
2024-06-17 Lightweight Model Pre-training via Language Guided Knowledge Distillation Mingsheng Li et.al. 2406.11689v1 link
2024-06-14 VideoGUI: A Benchmark for GUI Automation from Instructional Videos Kevin Qinghong Lin et.al. 2406.10227v1 null
2024-06-14 Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding Ridouane Ghermi et.al. 2406.10221v1 null
2024-06-14 SSTFB: Leveraging self-supervised pretext learning and temporal self-attention with feature branching for real-time video polyp segmentation Ziang Xu et.al. 2406.10200v1 null
2024-06-14 CarLLaVA: Vision language models for camera-only closed-loop driving Katrin Renz et.al. 2406.10165v1 null
2024-06-14 Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition Guinan Li et.al. 2406.10152v1 null
2024-06-14 Training-free Camera Control for Video Generation Chen Hou et.al. 2406.10126v1 null
2024-06-14 Modified Risk Formulation for Improving the Prediction of Knee Osteoarthritis Progression Haresh Rengaraj Rajamohan et.al. 2406.10119v1 null
2024-06-14 ECGMamba: Towards Efficient ECG Classification with BiSSM Yupeng Qiang et.al. 2406.10098v1 null
2024-06-14 Biomarker based Cancer Classification using an Ensemble with Pre-trained Models Chongmin Lee et.al. 2406.10087v1 null
2024-06-14 On the Evaluation of Speech Foundation Models for Spoken Language Understanding Siddhant Arora et.al. 2406.10083v1 null
2024-06-13 VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Muhammad Maaz et.al. 2406.09418v1 link
2024-06-13 An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Duy-Kien Nguyen et.al. 2406.09415v1 null
2024-06-13 CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras Sachin Shah et.al. 2406.09409v1 null
2024-06-13 Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion Linzhan Mou et.al. 2406.09402v1 null
2024-06-13 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation Junke Wang et.al. 2406.09399v1 link
2024-06-13 Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA Jongwoo Park et.al. 2406.09396v1 null
2024-06-13 LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living Rajatsubhra Chakraborty et.al. 2406.09390v1 null
2024-06-13 Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior Baiang Li et.al. 2406.09389v1 null
2024-06-13 Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition Youngtaek Oh et.al. 2406.09388v1 link
2024-06-13 SimGen: Simulator-conditioned Driving Scene Generation Yunsong Zhou et.al. 2406.09386v1 null
2024-06-12 On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models Hashmat Shadab Malik et.al. 2406.08486v1 link
2024-06-12 RMem: Restricted Memory Banks Improve Video Object Segmentation Junbao Zhou et.al. 2406.08476v1 null
2024-06-12 AToM-Bot: Embodied Fulfillment of Unspoken Human Needs with Affective Theory of Mind Wei Ding et.al. 2406.08455v1 null
2024-06-12 Transformation-Dependent Adversarial Attacks Yaoteng Tan et.al. 2406.08443v1 null
2024-06-12 A Sticker is Worth a Thousand Words: Characterizing the Use of Stickers in WhatsApp Political Groups in Brazil Philipe Melo et.al. 2406.08429v1 null
2024-06-12 Improving Noise Robustness through Abstractions and its Impact on Machine Learning Alfredo Ibias et.al. 2406.08428v1 null
2024-06-12 OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Qingyun Li et.al. 2406.08418v1 link
2024-06-13 MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos Xuehai He et.al. 2406.08407v2 link
2024-06-12 Eyes Wide Unshut: Unsupervised Mistake Detection in Egocentric Video by Detecting Unpredictable Gaze Michele Mazzamuto et.al. 2406.08379v1 null
2024-06-12 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction Tianqi Chen et.al. 2406.08374v1 null
2024-06-11 Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring Huicong Zhang et.al. 2406.07551v1 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548v1 link
2024-06-11 Zero-shot Image Editing with Reference Imitation Xi Chen et.al. 2406.07547v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar et.al. 2406.07539v1 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538v1 null
2024-06-11 Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection Wenxiao Wang et.al. 2406.07536v1 null
2024-06-11 Dynamics of the non-radial energy-critical inhomogeneous NLS Carlos M. Guzmán et.al. 2406.07535v1 null
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515v1 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506v1 link
2024-06-10 NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing Ting-Hsuan Chen et.al. 2406.06523v1 null
2024-06-10 Data Augmentation for Multivariate Time Series Classification: An Experimental Study Romain Ilbert et.al. 2406.06518v1 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512v1 null
2024-06-10 Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer Sigal Raab et.al. 2406.06508v1 link
2024-06-10 Equivariant Neural Tangent Kernels Philipp Misof et.al. 2406.06504v1 null
2024-06-10 Viscous shock fluctuations in KPZ Alexander Dunlap et.al. 2406.06502v1 null
2024-06-10 NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative Asmar Nadeem et.al. 2406.06499v1 null
2024-06-10 Demonstrating HumanTHOR: A Simulation Platform and Benchmark for Human-Robot Collaboration in a Shared Workspace Chenxu Wang et.al. 2406.06498v1 null
2024-06-10 Graph-Based Bidirectional Transformer Decision Threshold Adjustment Algorithm for Class-Imbalanced Molecular Data Nicole Hayes et.al. 2406.06479v1 null
2024-06-10 DiffAudit: Auditing Privacy Practices of Online Services for Children and Adolescents Olivia Figueira et.al. 2406.06473v1 null
2024-06-07 DVOS: Self-Supervised Dense-Pattern Video Object Segmentation Keyhan Najafian et.al. 2406.05131v1 null
2024-06-07 Compositional Curvature Bounds for Deep Neural Networks Taha Entesari et.al. 2406.05119v1 null
2024-06-07 Large Generative Graph Models Yu Wang et.al. 2406.05109v1 null
2024-06-07 A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification Christian Giannetti et.al. 2406.05096v1 null
2024-06-10 Discovery of An Apparent Red, High-Velocity Type Ia Supernova at z = 2.9 with JWST J. D. R. Pierel et.al. 2406.05089v2 null
2024-06-07 CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion Xingrui Wang et.al. 2406.05082v1 null
2024-06-10 Discovery of a Relativistic Stripped Envelope Type Ic-BL Supernova at z = 2.83 with JWST M. R. Siebert et.al. 2406.05076v2 null
2024-06-07 Diving Deep into the Motion Representation of Video-Text Models Chinmaya Devaraj et.al. 2406.05075v1 null
2024-06-07 Hibou: A Family of Foundational Vision Transformers for Pathology Dmitry Nechaev et.al. 2406.05074v1 null
2024-06-07 Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations Benjamin Fresz et.al. 2406.05068v1 link
2024-06-06 Verbalized Machine Learning: Revisiting Machine Learning with Language Models Tim Z. Xiao et.al. 2406.04344v1 null
2024-06-07 Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion Fangfu Liu et.al. 2406.04338v2 null
2024-06-06 Parameter-Inverted Image Pyramid Networks Xizhou Zhu et.al. 2406.04330v1 link
2024-06-06 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Lin Chen et.al. 2406.04325v1 null
2024-06-06 SF-V: Single Forward Video Generation Model Zhixing Zhang et.al. 2406.04324v1 null
2024-06-06 ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Qianlan Yang et.al. 2406.04323v1 null
2024-06-06 VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling Zeyue Tian et.al. 2406.04321v1 link
2024-06-06 Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models Ali Behrouz et.al. 2406.04320v1 null
2024-06-06 Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology Prediction Chen-Yu Yen et.al. 2406.04318v1 null
2024-06-06 Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks Tristan Cinquin et.al. 2406.04317v1 null
2024-06-05 Grokking Modular Polynomials Darshil Doshi et.al. 2406.03495v1 null
2024-06-05 The Logarithmic Memristor-Based Bayesian Machine Clément Turck et.al. 2406.03492v1 null
2024-06-05 Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review Sonia Bbouzidi et.al. 2406.03478v1 null
2024-06-05 Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach Haoyu Han et.al. 2406.03464v1 null
2024-06-05 Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts Dominik Scheuble et.al. 2406.03461v1 null
2024-06-05 FILS: Self-Supervised Video Feature Prediction In Semantic Language Space Mona Ahmadian et.al. 2406.03447v1 null
2024-06-05 Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input Joachim Ott et.al. 2406.03439v1 null
2024-06-05 Stabilizing massless fields with fluxes in Landau-Ginzburg models Katrin Becker et.al. 2406.03435v1 null
2024-06-05 Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis Moein Heidari et.al. 2406.03430v1 link
2024-06-05 Post-hoc Part-prototype Networks Andong Tan et.al. 2406.03421v1 null
2024-06-05 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541v2 null
2024-06-04 ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Tianchen Zhao et.al. 2406.02540v1 null
2024-06-04 Enhancing predictive imaging biomarker discovery through treatment effect analysis Shuhan Xiao et.al. 2406.02534v1 null
2024-06-04 ReLUs Are Sufficient for Learning Implicit Neural Representations Joseph Shenouda et.al. 2406.02529v1 link
2024-06-04 RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots Soroush Nasiriany et.al. 2406.02523v1 null
2024-06-04 DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering Zhongpai Gao et.al. 2406.02518v1 null
2024-06-04 V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation Cong Wang et.al. 2406.02511v1 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509v1 null
2024-06-04 Endomorphisms of Artin groups of type $\tilde A_n$ Luis Paris et.al. 2406.02484v1 null
2024-06-04 Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion Colin Hansen et.al. 2406.02477v1 null
2024-05-31 Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis Chaoyou Fu et.al. 2405.21075v1 null
2024-05-31 Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights Xin Wen et.al. 2405.21070v1 link
2024-05-31 You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet Zhen Qin et.al. 2405.21022v1 null
2024-05-31 Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging Niloufar Zakariaei et.al. 2405.21021v1 null
2024-05-31 The classification of dp-minimal integral domains Christian d'Elbée et.al. 2405.21014v1 null
2024-05-31 Early Stopping Criteria for Training Generative Adversarial Networks in Biomedical Imaging Muhammad Muneeb Saad et.al. 2405.20987v1 null
2024-05-31 PUAL: A Classifier on Trifurcate Positive-Unlabeled Data Xiaoke Wang et.al. 2405.20970v1 null
2024-05-31 Aligning Multiclass Neural Network Classifier Criterion with Task Performance via $F_β$-Score Nathan Tsoi et.al. 2405.20954v1 null
2024-05-31 Standard model of electromagnetism and chirality in crystals R. Winkler et.al. 2405.20940v1 null
2024-05-31 MALT: Multi-scale Action Learning Transformer for Online Action Detection Zhipeng Yang et.al. 2405.20892v1 null
2024-05-30 MotionLLM: Understanding Human Behaviors from Human Motions and Videos Ling-Hao Chen et.al. 2405.20340v1 null
2024-05-30 OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving Lening Wang et.al. 2405.20337v1 link
2024-05-30 VividDream: Generating 3D Scene with Ambient Dynamics Yao-Chih Lee et.al. 2405.20334v1 null
2024-05-30 SurgiTrack: Fine-Grained Multi-Class Multi-Tool Tracking in Surgical Videos Chinedu Innocent Nwoye et.al. 2405.20333v1 null
2024-05-31 4DHands: Reconstructing Interactive Hands in 4D with Transformers Dixuan Lin et.al. 2405.20330v2 null
2024-05-30 MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion Shuyuan Tu et.al. 2405.20325v1 null
2024-05-30 Vision-based Manipulation from Single Human Video with Open-World Object Graphs Yifeng Zhu et.al. 2405.20321v1 null
2024-05-30 Improving the Training of Rectified Flows Sangyun Lee et.al. 2405.20320v1 link
2024-05-30 CausalQuest: Collecting Natural Causal Questions for AI Agents Roberto Ceraolo et.al. 2405.20318v1 link
2024-05-30 Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models Himangi Mittal et.al. 2405.20305v1 null
2024-05-29 X-VILA: Cross-Modality Alignment for Large Language Model Hanrong Ye et.al. 2405.19335v1 null
2024-05-29 LLMs Meet Multimodal Generation and Editing: A Survey Yingqing He et.al. 2405.19334v1 link
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333v1 null
2024-05-29 NPGA: Neural Parametric Gaussian Avatars Simon Giebenhain et.al. 2405.19331v1 null
2024-05-29 Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation Atrisha Sarkar et.al. 2405.19328v1 null
2024-05-29 DGD: Dynamic 3D Gaussians Distillation Isaac Labe et.al. 2405.19321v1 null
2024-05-29 Real-Time Environment Condition Classification for Autonomous Vehicles Marco Introvigne et.al. 2405.19305v1 null
2024-05-29 Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare Hanwei Zhu et.al. 2405.19298v1 null
2024-05-29 Archetype-Based Redshift Estimation for the Dark Energy Spectroscopic Instrument Survey Abhijeet Anand et.al. 2405.19288v1 null
2024-05-29 A study on the adequacy of common IQA measures for medical images Anna Breger et.al. 2405.19224v1 null
2024-05-28 Classifying Overlapping Gaussian Mixtures in High Dimensions: From Optimal Classifiers to Neural Nets Khen Cohen et.al. 2405.18427v1 null
2024-05-28 GFlow: Recovering 4D World from Monocular Video Shizun Wang et.al. 2405.18426v1 null
2024-05-28 Hierarchical World Models as Visual Whole-Body Humanoid Controllers Nicklas Hansen et.al. 2405.18418v1 null
2024-05-28 3D StreetUnveiler with Semantic-Aware 2DGS Jingwei Xu et.al. 2405.18416v1 null
2024-05-28 Why are Visually-Grounded Language Models Bad at Image Classification? Yuhui Zhang et.al. 2405.18415v1 link
2024-05-28 Towards a Sampling Theory for Implicit Neural Representations Mahrokh Najaf et.al. 2405.18410v1 null
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407v1 null
2024-05-28 RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Jaehong Yoon et.al. 2405.18406v1 null
2024-05-28 MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning Somnath Kumar et.al. 2405.18358v1 null
2024-05-28 Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography Jie Liu et.al. 2405.18356v1 link
2024-05-27 Matryoshka Multimodal Models Mu Cai et.al. 2405.17430v1 null
2024-05-27 NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models Chankyu Lee et.al. 2405.17428v1 null
2024-05-27 MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds Jiahui Lei et.al. 2405.17421v1 null
2024-05-27 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control Zhengfei Kuang et.al. 2405.17414v1 null
2024-05-27 Enhancing Music Genre Classification through Multi-Algorithm Analysis and User-Friendly Visualization Navin Kamuni et.al. 2405.17413v1 null
2024-05-27 The Peripatetic Hater: Predicting Movement Among Hate Subreddits Daniel Hickey et.al. 2405.17410v1 null
2024-05-27 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer Ruizhi Shao et.al. 2405.17405v1 null
2024-05-27 Spectral Greedy Coresets for Graph Neural Networks Mucong Ding et.al. 2405.17404v1 null
2024-05-27 Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability Shenyuan Gao et.al. 2405.17398v1 link
2024-05-27 Non-Unitary Quantum Machine Learning Jamie Heredge et.al. 2405.17388v1 null
2024-05-24 Canonical Variates in Wasserstein Metric Space Jia Li et.al. 2405.15768v1 null
2024-05-24 Scaling Laws for Discriminative Classification in Large Language Models Dean Wyatte et.al. 2405.15765v1 null
2024-05-24 InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation Yuchi Wang et.al. 2405.15758v1 link
2024-05-24 Looking Backward: Streaming Video-to-Video Translation with Feature Banks Feng Liang et.al. 2405.15757v1 link
2024-05-24 Characterizing Discourse Group Roles in Inquiry-based University Science Labs Tong Wan et.al. 2405.15746v1 null
2024-05-24 Hierarchical Uncertainty Exploration via Feedforward Posterior Trees Elias Nehme et.al. 2405.15719v1 null
2024-05-24 EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences Jocelyn Shen et.al. 2405.15708v1 null
2024-05-24 Sums: Sniffing Unknown Multiband Signals under Low Sampling Rates Jinbo Peng et.al. 2405.15705v1 null
2024-05-24 realSEUDO for real-time calcium imaging analysis Iuliia Dmitrieva et.al. 2405.15701v1 null
2024-05-24 UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes Ted Lentsch et.al. 2405.15688v1 null
2024-05-23 PuzzleAvatar: Assembling 3D Avatars from Personal Albums Yuliang Xiu et.al. 2405.14869v1 null
2024-05-23 Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis Basile Van Hoorick et.al. 2405.14868v1 null
2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao et.al. 2405.14864v1 null
2024-05-23 Synergistic Global-space Camera and Human Reconstruction from Videos Yizhou Zhao et.al. 2405.14855v1 null
2024-05-23 Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron Can Cui1 et.al. 2405.14851v1 null
2024-05-23 Learning to Detect and Segment Mobile Objects from Unlabeled Videos Yihong Sun et.al. 2405.14841v1 null
2024-05-23 Designing A Sustainable Marine Debris Clean-up Framework without Human Labels Raymond Wang et.al. 2405.14815v1 null
2024-05-23 As an AI Language Model, "Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making Shomik Jain et.al. 2405.14812v1 null
2024-05-23 Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics Jonas Spinner et.al. 2405.14806v1 null
2024-05-24 Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation Hongxu Jiang et.al. 2405.14802v2 link
2024-05-21 Comprehensive Multimodal Deep Learning Survival Prediction Enabled by a Transformer Architecture: A Multicenter Study in Glioblastoma Ahmed Gomaa et.al. 2405.12963v1 null
2024-05-21 Online Learning of Halfspaces with Massart Noise Ilias Diakonikolas et.al. 2405.12958v1 null
2024-05-21 Quantifying Uncertainty in Classification Performance: ROC Confidence Bands Using Conformal Prediction Zheshi Zheng et.al. 2405.12953v1 null
2024-05-21 Tutorly: Turning Programming Videos Into Apprenticeship Learning Environments with LLMs Wengxi Li et.al. 2405.12946v1 null
2024-05-21 Pytorch-Wildlife: A Collaborative Deep Learning Framework for Conservation Andres Hernandez et.al. 2405.12930v1 link
2024-05-21 Streamlining Software Reviews: Efficient Predictive Modeling with Minimal Examples Tim Menzies et.al. 2405.12920v1 null
2024-05-21 The $L_p$-dual space of a semisimple Lie group Bachir Bekka et.al. 2405.12919v1 null
2024-05-21 Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment Holli Sargeant et.al. 2405.12910v1 link
2024-05-21 Decentralized Federated Learning Over Imperfect Communication Channels Weicai Li et.al. 2405.12894v1 null
2024-05-21 Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models Abdurahmman Alzahrani et.al. 2405.12884v1 null
2024-05-20 Images that Sound: Composing Images and Sounds on a Single Canvas Ziyang Chen et.al. 2405.12221v1 null
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211v1 null
2024-05-20 The sign of scalar curvature on Kähler blowups Garrett M. Brown et.al. 2405.12189v1 null
2024-05-20 Building Temporal Kernels with Orthogonal Polynomials Yan Ru Pei et.al. 2405.12179v1 link
2024-05-20 Wireless vs. Traditional Ultrasound Assessed Knee Cartilage Outcomes Utilizing Automated Gain and Normalization Techniques Arjun Parmar et.al. 2405.12172v1 null
2024-05-20 DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM Xuchen Li et.al. 2405.12139v1 null
2024-05-20 Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models Nida Nasir et.al. 2405.12126v1 null
2024-05-20 An Active Learning Framework with a Class Balancing Strategy for Time Series Classification Shemonto Das et.al. 2405.12122v1 null
2024-05-20 AGNfitter-rx: Modelling the radio-to-X-ray SEDs of AGNs L. N. Martínez-Ramírez et.al. 2405.12111v1 null
2024-05-20 Real topological phonons in 3D carbon allotropes Xiaotian Wang et.al. 2405.12072v1 null
2024-05-17 Submodular Information Selection for Hypothesis Testing with Misclassification Penalties Jayanth Bhargav et.al. 2405.10930v1 null
2024-05-17 A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model Mingxiang Fu et.al. 2405.10890v1 null
2024-05-17 Multicenter Privacy-Preserving Model Training for Deep Learning Brain Metastases Autosegmentation Yixing Huang et.al. 2405.10870v1 null
2024-05-17 "Hall" transport of liquid crystal solitons in Couette flow Rodrigo C. V. Coelho et.al. 2405.10850v1 null
2024-05-17 Automatic segmentation of Organs at Risk in Head and Neck cancer patients from CT and MRI scans Sébastien Quetin et.al. 2405.10833v1 null
2024-05-17 Open-Vocabulary Spatio-Temporal Action Detection Tao Wu et.al. 2405.10832v1 null
2024-05-17 Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities Hao Zhou et.al. 2405.10825v1 null
2024-05-17 ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios Markus Bayer et.al. 2405.10808v1 null
2024-05-17 A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability Abdul Rehman et.al. 2405.10803v1 null
2024-05-17 Reduced storage direct tensor ring decomposition for convolutional neural networks compression Mateusz Gabor et.al. 2405.10802v1 link
2024-05-16 TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction Yunfan Jiang et.al. 2405.10315v1 null
2024-05-16 4D Panoptic Scene Graph Generation Jingkang Yang et.al. 2405.10305v1 link
2024-05-16 On Sample Selection for Continual Learning: a Video Streaming Case Study Alexander Dietmüller et.al. 2405.10290v1 null
2024-05-16 Quantum Vision Transformers for Quark-Gluon Classification Marçal Comajoan Cara et.al. 2405.10284v1 null
2024-05-16 Faces that Speak: Jointly Synthesising Talking Face and Speech from Text Youngjoon Jang et.al. 2405.10272v1 null
2024-05-16 A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision Charles Raude et.al. 2405.10266v1 null
2024-05-16 PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology George Shaikovski et.al. 2405.10254v1 null
2024-05-16 A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts Xinru Zhang et.al. 2405.10246v1 null
2024-05-16 Ternary mappings of some evolution algebras Candido Martin Gonzalez et.al. 2405.10241v1 null
2024-05-16 ENADPool: The Edge-Node Attention-based Differentiable Pooling for Graph Neural Networks Zhehan Zhao et.al. 2405.10218v1 null
2024-05-15 Classifying geospatial objects from multiview aerial imagery using semantic meshes David Russell et.al. 2405.09544v1 null
2024-05-15 Spectral complexity of deep neural networks Simmaco Di Lillo et.al. 2405.09541v1 null
2024-05-16 MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer Chengyu Wu et.al. 2405.09539v2 link
2024-05-15 Restoring balance: principled under/oversampling of data for optimal classification Emanuele Loffredo et.al. 2405.09535v1 null
2024-05-15 Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck Hongru Li et.al. 2405.09514v1 null
2024-05-15 Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts Donya Rooein et.al. 2405.09482v1 null
2024-05-15 Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment Xinying Lin et.al. 2405.09472v1 null
2024-05-15 Non-contact Lung Disease Classification via OFDM-based Passive 6G ISAC Sensing Hasan Mujtaba Buttar et.al. 2405.09458v1 null
2024-05-15 Cohomogeneity one RCD-spaces Diego Corro et.al. 2405.09448v1 null
2024-05-15 M$^4$oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts Yufeng Jiang et.al. 2405.09446v1 null
2024-05-14 CinePile: A Long Video Question Answering Dataset and Benchmark Ruchit Rawal et.al. 2405.08813v1 null
2024-05-14 The Developing Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface Reconstruction Qiang Ma et.al. 2405.08783v1 null
2024-05-14 Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling Gregory Holste et.al. 2405.08780v1 null
2024-05-14 FolkTalent: Enhancing Classification and Tagging of Indian Folk Paintings Nancy Hada et.al. 2405.08776v1 null
2024-05-14 From Text to Context: An Entailment Approach for News Stakeholder Classification Alapan Kuila et.al. 2405.08751v1 null
2024-05-14 Enhancing Blind Video Quality Assessment with Rich Quality-aware Features Wei Sun et.al. 2405.08745v1 null
2024-05-14 The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks Carmela Calabrese et.al. 2405.08695v1 null
2024-05-14 Latent group structure in linear panel data models with endogenous regressors Junho Choi et.al. 2405.08687v1 null
2024-05-14 Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis Qingpeng Kong et.al. 2405.08681v1 link
2024-05-14 Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning Alain Riou et.al. 2405.08679v1 null
2024-05-14 MambaOut: Do We Really Need Mamba for Vision? Weihao Yu et.al. 2405.07992v2 link
2024-05-13 SPIN: Simultaneous Perception, Interaction and Navigation Shagun Uppal et.al. 2405.07991v1 null
2024-05-13 KG-Planner: Knowledge-Informed Graph Neural Planning for Collaborative Manipulators Wansong Liu et.al. 2405.07962v1 null
2024-05-13 An Algorithmic Classification of Generalized Pseudo-Anosov Homeomorphisms via Geometric Markov Partitions Inti Cruz Diaz et.al. 2405.07954v1 null
2024-05-13 Scene Action Maps: Behavioural Maps for Navigation without Metric Information Joel Loo et.al. 2405.07948v1 null
2024-05-14 PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition Ziyang Zhang et.al. 2405.07932v2 link
2024-05-13 Improving Multimodal Learning with Multi-Loss Gradient Modulation Konstantinos Kontras et.al. 2405.07930v1 null
2024-05-13 PLUTO: Pathology-Universal Transformer Dinkar Juyal et.al. 2405.07905v1 null
2024-05-13 Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer Chi-en Amy Tai et.al. 2405.07869v1 null
2024-05-13 Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging Chi-en Amy Tai et.al. 2405.07861v1 null
2024-05-10 Multi-Object Tracking in the Dark Xinzhe Wang et.al. 2405.06600v1 link
2024-05-10 Ice phase classification made easy with score-based denoising Hong Sun et.al. 2405.06599v1 null
2024-05-10 Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach Elham Ravanbakhsh et.al. 2405.06586v1 null
2024-05-10 Deep video representation learning: a survey Elham Ravanbakhsh et.al. 2405.06574v1 null
2024-05-10 The Role of Topological Photon Spheres in Constraining the Parameters of Black Holes Jafar Sadeghi et.al. 2405.06568v1 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547v1 link
2024-05-10 Separating States in Astronomical Sources Using Hidden Markov Models: With a Case Study of Flaring and Quiescence on EV Lac Robert Zimmerman et.al. 2405.06540v1 null
2024-05-10 Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation Xiaowen Ma et.al. 2405.06525v1 link
2024-05-10 Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks Haifa Alrdahi et.al. 2405.06499v1 null
2024-05-10 Improving Deep Learning Model Calibration for Cardiac Applications using Deterministic Uncertainty Networks and Uncertainty-aware Training Tareen Dawood et.al. 2405.06487v1 null
2024-05-09 A Universal Growth Rate for Learning with Smooth Surrogate Losses Anqi Mao et.al. 2405.05968v1 null
2024-05-09 Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask Zineb Senane et.al. 2405.05959v1 link
2024-05-09 Frame Interpolation with Consecutive Brownian Bridge Diffusion Zonglin Lyu et.al. 2405.05953v1 null
2024-05-09 Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers Peng Gao et.al. 2405.05945v1 link
2024-05-09 MRISegmentator-Abdomen: A Fully Automated Multi-Organ and Structure Segmentation Tool for T1-weighted Abdominal MRI Yan Zhuang et.al. 2405.05944v1 null
2024-05-09 Non-symplectic automorphisms of prime order of O'Grady's tenfolds and cubic fourfolds Simone Billi et.al. 2405.05932v1 null
2024-05-09 Deep Multi-Task Learning for Malware Image Classification Ahmed Bensaoud et.al. 2405.05906v1 null
2024-05-09 An RNN-policy gradient approach for quantum architecture search Gang Wang et.al. 2405.05892v1 null
2024-05-09 Composable Part-Based Manipulation Weiyu Liu et.al. 2405.05876v1 null
2024-05-09 ExACT: An End-to-End Autonomous Excavator System Using Action Chunking With Transformers Liangliang Chen et.al. 2405.05861v1 null
2024-05-08 Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo Nayantara Mudur et.al. 2405.05255v1 link
2024-05-08 Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models Hongjie Wang et.al. 2405.05252v1 null
2024-05-08 DanceCam: atmospheric turbulence mitigation in wide-field astronomical images with short-exposure video streams Spencer Bialek et.al. 2405.05250v1 null
2024-05-08 Deep learning-based variational autoencoder for classification of quantum and classical states of light Mahesh Bhupati et.al. 2405.05243v1 null
2024-05-08 On $\operatorname{Alt}(n)$-modules with an additive dimension when $n\le6$ Barry Chin et.al. 2405.05230v1 null
2024-05-08 Are Economically Advanced Countries More Efficient in Basic and Applied Research? Vladimír Holý et.al. 2405.05227v1 null
2024-05-08 Clustering Retail Products Based on Customer Behaviour Vladimír Holý et.al. 2405.05218v1 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216v1 link
2024-05-08 Graded Relevance Scoring of Written Essays with Dense Retrieval Salam Albatarni et.al. 2405.05200v1 null
2024-05-08 Is Transductive Learning Equivalent to PAC Learning? Shaddin Dughmi et.al. 2405.05190v1 null
2024-05-07 Switchable Decision: Dynamic Neural Generation Networks Shujian Zhang et.al. 2405.04513v1 null
2024-05-07 Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing Yi Zuo et.al. 2405.04496v1 null
2024-05-07 Exploration of Novel Neuromorphic Methodologies for Materials Applications Derek Gobin et.al. 2405.04478v1 null
2024-05-07 Generalized classical Yang-Baxter equation and regular decompositions Raschid Abedin et.al. 2405.04440v1 null
2024-05-07 On the classification of product-quotient surfaces with $q=0$, $p_g=3$ and their canonical map Federico Fallucca et.al. 2405.04425v1 null
2024-05-07 Vision Mamba: A Comprehensive Survey and Taxonomy Xiao Liu et.al. 2405.04404v1 link
2024-05-07 Efficient Online Set-valued Classification with Bandit Feedback Zhou Wang et.al. 2405.04393v1 null
2024-05-07 DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving Chen Min et.al. 2405.04390v1 null
2024-05-07 Parallelized Multi-Agent Bayesian Optimization in Lava Shay Snyder et.al. 2405.04387v1 null
2024-05-07 Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs Antonio Bikić et.al. 2405.04386v1 null
2024-05-06 Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs Muhammad Uzair Khattak et.al. 2405.03690v1 null
2024-05-06 All-in-One Deep Learning Framework for MR Image Reconstruction Geunu Jeong et.al. 2405.03684v1 null
2024-05-06 ScrewMimic: Bimanual Imitation from Human Videos with Screw Space Projection Arpit Bahety et.al. 2405.03666v1 null
2024-05-06 CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification Sankalp Sinha et.al. 2405.03660v1 null
2024-05-06 Collecting Consistently High Quality Object Tracks with Minimal Human Involvement by Using Self-Supervised Learning to Detect Tracker Errors Samreen Anjum et.al. 2405.03643v1 null
2024-05-06 Classification of Breast Cancer Histopathology Images using a Modified Supervised Contrastive Learning Method Matina Mahdizadeh Sani et.al. 2405.03642v1 link
2024-05-06 Nonequilibrium relaxation and odd-even effect in finite-temperature electron gases Eric Nilsson et.al. 2405.03635v1 null
2024-05-06 Nonnegative Matrix Factorization in Dimensionality Reduction: A Survey Farid Saberi-Movahed et.al. 2405.03615v1 null
2024-05-06 Dual Relation Mining Network for Zero-Shot Learning Jinwei Han et.al. 2405.03613v1 null
2024-05-06 Communities for the Lagrangian Dynamics of the Turbulent Velocity Gradient Tensor: A Network Participation Approach Christopher J. Keylock et.al. 2405.03589v1 null
2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos Wen-Hsuan Chu et.al. 2405.02280v1 null
2024-05-03 Transversely Projective Structures on Smooth Foliations on Surfaces Gabriel Fazoli et.al. 2405.02273v1 null
2024-05-03 On its way to the neutron star-white dwarf binary graveyard, IGR J16194-2810, a first ascent M giant X-ray binary K. H. Hinkle et.al. 2405.02270v1 null
2024-05-03 Validating Gaia DR3 Pulsating Variable Classifications with TESS: Building Reliable $δ$ Scuti and $γ$ Doradus Stars Catalogs (In Progress) Ai-Ying Zhou et.al. 2405.02264v1 null
2024-05-03 Subgraph2vec: A random walk-based algorithm for embedding knowledge graphs Elika Bozorgi et.al. 2405.02240v1 null
2024-05-03 Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks Lujing Zhang et.al. 2405.02225v1 null
2024-05-03 Designed Dithering Sign Activation for Binary Neural Networks Brayan Monroy et.al. 2405.02220v1 null
2024-05-03 Multispectral Fine-Grained Classification of Blackgrass in Wheat and Barley Crops Madeleine Darbyshire et.al. 2405.02218v1 null
2024-05-03 Non-Destructive Peat Analysis using Hyperspectral Imaging and Machine Learning Yijun Yan et.al. 2405.02191v1 null
2024-05-03 Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset Hsuvas Borkakoty et.al. 2405.02175v1 null
2024-05-02 Confronting sparse Gaia DR3 photometry with TESS for a sample of about 60,000 hot massive non-radial pulsators Daniel Hey et.al. 2405.01539v1 null
2024-05-02 Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal et.al. 2405.01534v1 null
2024-05-02 Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models Nishad Singhi et.al. 2405.01531v1 null
2024-05-02 Track2Act: Predicting Point Tracks from Internet Videos enables Diverse Zero-shot Robot Manipulation Homanga Bharadhwaj et.al. 2405.01527v1 null
2024-05-03 A separability-based approach to quantifying generalization: which layer is best? Luciano Dyballa et.al. 2405.01524v2 null
2024-05-02 Grand Design vs. Multi-Armed Spiral Galaxies: Dependence on Galaxy Structure Beverly J. Smith et.al. 2405.01516v1 null
2024-05-03 Accelerating Convergence in Bayesian Few-Shot Classification Tianjun Ke et.al. 2405.01507v2 link
2024-05-02 PAM-UNet: Shifting Attention on Region of Interest in Medical Images Abhijit Das et.al. 2405.01503v1 null
2024-05-02 Exploring Privacy Issues in Mission Critical Communication: Navigating 5G and Beyond Networks Prajnamaya Dass et.al. 2405.01492v1 null
2024-05-02 Designing Algorithmic Recommendations to Achieve Human-AI Complementarity Bryce McLaughlin et.al. 2405.01484v1 null
2024-05-01 Quantum algorithms for matrix geometric means Nana Liu et.al. 2405.00673v1 null
2024-05-01 Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays Andrei Chubarau et.al. 2405.00670v1 null
2024-05-01 Screening of BindingDB database ligands against EGFR, HER2, Estrogen, Progesterone and NF-kB receptors based on machine learning and molecular docking Parham Rezaee et.al. 2405.00647v1 null
2024-05-01 Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling Yida Mu et.al. 2405.00611v1 null
2024-05-01 Investigating Automatic Scoring and Feedback using Large Language Models Gloria Ashiya Katuka et.al. 2405.00602v1 null
2024-05-01 Discovering robust biomarkers of neurological disorders from functional MRI using graph neural networks: A Review Yi Hao Chan et.al. 2405.00577v1 null
2024-05-01 EALD-MLLM: Emotion Analysis in Long-sequential and De-identity videos with Multi-modal Large Language Model Deng Li et.al. 2405.00574v1 null
2024-05-01 Remote Sensing Data Assimilation with a Chained Hydrologic-hydraulic Model for Flood Forecasting Thanh Huy Nguyen et.al. 2405.00567v1 null
2024-05-01 Digital-analog quantum convolutional neural networks for image classification Anton Simen et.al. 2405.00548v1 null
2024-05-01 UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement Ruiquan Ge et.al. 2405.00542v1 link
2024-04-30 A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications Steph Buongiorno et.al. 2404.19729v1 null
2024-04-30 Classification of simple 0-dimensional isolated complete intersection singularities Thuy Huong Pham et.al. 2404.19728v1 null
2024-04-30 PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios Jingbo Wang et.al. 2404.19722v1 null
2024-04-30 PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games Steph Buongiorno et.al. 2404.19721v1 null
2024-04-30 ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents Hoang-Thang Ta et.al. 2404.19714v1 null
2024-04-30 A rank decomposition for the topological classification of neural representations Kosio Beshkov et.al. 2404.19710v1 null
2024-04-30 Neural Controlled Differential Equations with Quantum Hidden Evolutions Lingyi Yang et.al. 2404.19673v1 link
2024-04-30 Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity Lei Wang et.al. 2404.19666v1 null
2024-04-30 Towards Generalist Robot Learning from Internet Video: A Survey Robert McCarthy et.al. 2404.19664v1 null
2024-04-30 Regularization of Riemannian optimization: Application to process tomography and quantum machine learning Felix Soest et.al. 2404.19659v1 null
2024-04-29 Hallucination of Multimodal Large Language Models: A Survey Zechen Bai et.al. 2404.18930v1 link
2024-04-29 Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing Leonardo Rossi et.al. 2404.18924v1 null
2024-04-29 Anomaly and invertible field theory with higher-form symmetry: Extended group cohomology Shi Chen et.al. 2404.18921v1 null
2024-04-29 A Survey on Diffusion Models for Time Series and Spatio-Temporal Data Yiyuan Yang et.al. 2404.18886v1 link
2024-04-29 A Multilevel Strategy to Improve People Tracking in a Real-World Scenario Cristiano B. de Oliveira et.al. 2404.18876v1 null
2024-04-29 A Survey on Vision Mamba: Models, Applications and Challenges Rui Xu et.al. 2404.18861v1 link
2024-04-29 ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization Hong Nguyen et.al. 2404.18831v1 link
2024-04-29 Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior Zhiyuan Li et.al. 2404.18820v1 null
2024-04-29 Certification of Speaker Recognition Models to Additive Perturbations Dmitrii Korzh et.al. 2404.18791v1 null
2024-04-29 Understanding Radicals via Orbital Parities Reza G. Shirazi et.al. 2404.18787v1 null
2024-04-26 Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos Zhengze Xu et.al. 2404.17571v1 null
2024-04-26 Multifold topological semimetals Iñigo Robredo et.al. 2404.17539v1 null
2024-04-26 Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models Yuhang Huang et.al. 2404.17534v1 null
2024-04-26 Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations Puhao Li et.al. 2404.17521v1 link
2024-04-26 Learning text-to-video retrieval from image captioning Lucas Ventura et.al. 2404.17498v1 null
2024-04-26 Tabular Data Contrastive Learning via Class-Conditioned and Feature-Correlation Based Augmentation Wei Cui et.al. 2404.17489v1 link
2024-04-26 Low Cost Machine Vision for Insect Classification Danja Brandt et.al. 2404.17488v1 null
2024-04-26 Conformal Prediction with Learned Features Shayan Kiyani et.al. 2404.17487v1 null
2024-04-26 Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model Zhenghong Li et.al. 2404.17484v1 null
2024-04-26 One-Shot Image Restoration Deborah Pereg et.al. 2404.17426v1 null
2024-04-25 Made to Order: Discovering monotonic temporal changes via self-supervised video ordering Charig Yang et.al. 2404.16828v1 null
2024-04-25 ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images Weiqi Li et.al. 2404.16825v1 null
2024-04-25 V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection Xuanyu Zhang et.al. 2404.16824v1 null
2024-04-25 Learning Visuotactile Skills with Two Multifingered Hands Toru Lin et.al. 2404.16823v1 link
2024-04-25 Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution Zeynep Özdemir et.al. 2404.16814v1 null
2024-04-25 Transformer-Based Local Feature Matching for Multimodal Image Registration Remi Delaunay et.al. 2404.16802v1 null
2024-04-25 DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks Tongzhou Mu et.al. 2404.16779v1 null
2024-04-25 Modeling Selective Feature Attention for Representation-based Siamese Text Matching Jianxiang Zang et.al. 2404.16776v1 link
2024-04-25 Classifying One-Dimensional Quantum States Prepared by a Single Round of Measurements Rahul Sahay et.al. 2404.16753v1 null
2024-04-25 Characterizing Solar Center-to-Limb Radial-Velocity Variability with SDO Michael L. Palumbo III et.al. 2404.16747v1 null
2024-04-24 Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models Xu Shen et.al. 2404.15625v1 null
2024-04-24 Layer Ensemble Averaging for Improving Memristor-Based Artificial Neural Network Performance Osama Yousuf et.al. 2404.15621v1 null
2024-04-24 A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution Zhixiong Yang et.al. 2404.15620v1 link
2024-04-24 MDDD: Manifold-based Domain Adaptation with Dynamic Distribution for Non-Deep Transfer Learning in Cross-subject and Cross-session EEG-based Emotion Recognition Ting Luo et.al. 2404.15615v1 null
2024-04-24 Federated Learning with Only Positive Labels by Exploring Label Correlations Xuming An et.al. 2404.15598v1 null
2024-04-24 A Survey of Deep Long-Tail Classification Advancements Charika de Alvis et.al. 2404.15593v1 null
2024-04-24 Domain Adaptation for Learned Image Compression with Supervised Adapters Alberto Presta et.al. 2404.15591v1 null
2024-04-24 Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification Liang Qu et.al. 2404.15585v1 null
2024-04-24 Research on OPF control of three-phase four-wire low-voltage distribution network considering uncertainty Rui Wang et.al. 2404.15584v1 null
2024-04-24 MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis Jiaxin Zhuang et.al. 2404.15580v1 null
2024-04-23 ID-Animator: Zero-Shot Identity-Preserving Human Video Generation Xuanhua He et.al. 2404.15275v1 link
2024-04-23 Metric-guided Image Reconstruction Bounds via Conformal Prediction Matt Y Cheung et.al. 2404.15274v1 link
2024-04-23 Quantum optical classifier with superexponential speedup Simone Roncallo et.al. 2404.15266v1 null
2024-04-23 TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting Jiahe Li et.al. 2404.15264v1 null
2024-04-23 Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization Lahav Lipson et.al. 2404.15263v1 link
2024-04-23 FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent Cameron Smith et.al. 2404.15259v1 null
2024-04-23 Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions Xingguang Zhang et.al. 2404.15252v1 null
2024-04-23 Unifying the Temperature Dependent Dynamics of Glasses Joseph B. Schlenoff et.al. 2404.15250v1 null
2024-04-23 Mining Invariance from Nonlinear Multi-Environment Data: Binary Classification Austin Goddard et.al. 2404.15245v1 null
2024-04-23 Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models Aidan Z. H. Yang et.al. 2404.15236v1 null
2024-04-22 AutoAD III: The Prequel -- Back to the Pixels Tengda Han et.al. 2404.14412v1 null
2024-04-22 Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses Inhee Lee et.al. 2404.14410v1 null
2024-04-22 Hyp-OC: Hyperbolic One Class Classification for Face Anti-Spoofing Kartik Narayan et.al. 2404.14406v1 null
2024-04-22 A mean curvature flow arising in adversarial training Leon Bungert et.al. 2404.14402v1 null
2024-04-22 TAVGBench: Benchmarking Text to Audible-Video Generation Yuxin Mao et.al. 2404.14381v1 link
2024-04-22 Rethinking Legal Compliance Automation: Opportunities with Large Language Models Shabnam Hassani et.al. 2404.14356v1 null
2024-04-22 On-the-Fly Point Annotation for Fast Medical Video Labeling Meyer Adrien et.al. 2404.14344v1 null
2024-04-22 X-Ray: A Sequential 3D Representation for Generation Tao Hu et.al. 2404.14329v1 null
2024-04-22 A Novel Approach to Chest X-ray Lung Segmentation Using U-net and Modified Convolutional Block Attention Module Mohammad Ali Labbaf Khaniki et.al. 2404.14322v1 null
2024-04-22 "I Upload...All Types of Different Things to Say, the World of Blindness Is More Than What They Think It Is": A Study of Blind TikTokers' Identity Work from a Flourishing Perspective Yao Lyu et.al. 2404.14305v1 null
2024-04-19 Data Alignment for Zero-Shot Concept Generation in Dermatology AI Soham Gadgil et.al. 2404.13043v1 null
2024-04-19 PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation Tianyuan Zhang et.al. 2404.13026v1 null
2024-04-19 BANF: Band-limited Neural Fields for Levels of Detail Reconstruction Ahan Shabanov et.al. 2404.13024v1 null
2024-04-19 Stronger Random Baselines for In-Context Learning Gregory Yauney et.al. 2404.13020v1 link
2024-04-19 A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks Ronglei Ji et.al. 2404.13018v1 null
2024-04-19 Towards Robust Ferrous Scrap Material Classification with Deep Learning and Conformal Prediction Paulo Henrique dos Santos et.al. 2404.13002v1 null
2024-04-19 RadRotator: 3D Rotation of Radiographs with Diffusion Models Pouria Rouzrokh et.al. 2404.13000v1 null
2024-04-19 Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture Zarif Ahmed et.al. 2404.12986v1 null
2024-04-19 Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics Xiaofei Wang et.al. 2404.12973v1 null
2024-04-19 Improving Pediatric Pneumonia Diagnosis with Adult Chest X-ray Images Utilizing Contrastive Learning and Embedding Similarity Mohammad Zunaed et.al. 2404.12958v1 null
2024-04-18 On the Content Bias in Fréchet Video Distance Songwei Ge et.al. 2404.12391v1 null
2024-04-18 Moving Object Segmentation: All You Need Is SAM (and Flow) Junyu Xie et.al. 2404.12389v1 null
2024-04-18 VideoGigaGAN: Towards Detail-rich Video Super-Resolution Yiran Xu et.al. 2404.12388v1 null
2024-04-18 Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Aitor Ormazabal et.al. 2404.12387v1 null
2024-04-18 G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis Yufei Ye et.al. 2404.12383v1 null
2024-04-18 Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Monocular Videos Isabella Liu et.al. 2404.12379v1 null
2024-04-18 RoboDreamer: Learning Compositional World Models for Robot Imagination Siyuan Zhou et.al. 2404.12377v1 null
2024-04-18 When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes Asaf Yehudai et.al. 2404.12365v1 null
2024-04-18 Inverse Neural Rendering for Explainable Multi-Object Tracking Julian Ost et.al. 2404.12359v1 null
2024-04-18 Improving the interpretability of GNN predictions through conformal-based graph sparsification Pablo Sanchez-Martin et.al. 2404.12356v1 link
2024-04-18 Dynamic Typography: Bringing Text to Life via Video Diffusion Prior Zichen Liu et.al. 2404.11614v2 null
2024-04-17 VG4D: Vision-Language Model Goes 4D Video Recognition Zhichao Deng et.al. 2404.11605v1 link
2024-04-17 Variational Bayesian Last Layers James Harrison et.al. 2404.11599v1 link
2024-04-17 State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend Fei Cui et.al. 2404.11576v1 null
2024-04-17 Simple Image Signal Processing using Global Context Guidance Omar Elezabi et.al. 2404.11569v1 link
2024-04-17 Spatio-Temporal Motion Retargeting for Quadruped Robots Taerim Yoon et.al. 2404.11557v1 null
2024-04-17 Predicting Long-horizon Futures by Conditioning on Geometry and Time Tarasha Khurana et.al. 2404.11554v1 null
2024-04-17 Carbon- and Oxygen-rich stars in MaStar: identification and classification Lewis Hill et.al. 2404.11541v1 null
2024-04-17 GenFighter: A Generative and Evolutive Textual Attack Removal Md Athikul Islam et.al. 2404.11538v1 null
2024-04-17 SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening Yu Zhong et.al. 2404.11537v1 null
2024-04-16 COMBO: Compositional World Models for Embodied Multi-Agent Cooperation Hongxin Zhang et.al. 2404.10775v1 null
2024-04-16 RapidVol: Rapid Reconstruction of 3D Ultrasound Volumes from Sensorless 2D Scans Mark C. Eid et.al. 2404.10766v1 null
2024-04-16 Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification Yu-Yang Li et.al. 2404.10757v1 null
2024-04-16 Integer-valued o-minimal functions Neer Bhardwaj et.al. 2404.10737v1 null
2024-04-16 Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning Hao-Lun Hsu et.al. 2404.10728v1 null
2024-04-16 AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation Zexin Li et.al. 2404.10714v1 null
2024-04-17 Dual Modalities of Text: Visual and Textual Generative Pre-training Yekun Chai et.al. 2404.10710v2 null
2024-04-16 Question Difficulty Ranking for Multiple-Choice Reading Comprehension Vatsal Raina et.al. 2404.10704v1 null
2024-04-16 Retrieval Augmented Verification : Unveiling Disinformation with Structured Representations for Zero-Shot Real-Time Evidence-guided Fact-Checking of Multi-modal Social media posts Arka Ujjal Dey et.al. 2404.10702v1 null
2024-04-16 Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs Georgy Perevozchikov et.al. 2404.10700v1 null
2024-04-15 Squish Jamming Samuel Poincloux et.al. 2404.09773v1 null
2024-04-15 Hilti SLAM Challenge 2023: Benchmarking Single + Multi-session SLAM across Sensor Constellations in Construction Ashish Devadas Nair et.al. 2404.09765v1 null
2024-04-15 Deep Learning-Based Segmentation of Tumors in PET/CT Volumes: Benchmark of Different Architectures and Training Strategies Monika Górka et.al. 2404.09761v1 null
2024-04-15 Quantization of Large Language Models with an Overdetermined Basis Daniil Merkulov et.al. 2404.09737v1 null
2024-04-15 FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features Andre Rochow et.al. 2404.09736v1 null
2024-04-15 Classification of finite type fusion quivers Ben Elias et.al. 2404.09714v1 null
2024-04-15 LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models Guangyan Li et.al. 2404.09695v1 null
2024-04-15 Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration Chenwei Lin et.al. 2404.09690v1 null
2024-04-15 Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition Tobias Weber et.al. 2404.09683v1 link
2024-04-15 Cluster analysis of the Roma-BZCAT blazars D. O. Kudryavtsev et.al. 2404.09667v1 null
2024-04-15 Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis Alessa Hering et.al. 2404.09666v1 null
2024-04-15 Closing the Gap in the Trade-off between Fair Representations and Accuracy Biswajit Rout et.al. 2404.09664v1 null
2024-04-15 If there's a Trigger Warning, then where's the Trigger? Investigating Trigger Warnings at the Passage Level Matti Wiegmann et.al. 2404.09615v1 link
2024-04-12 FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models Yanting Wang et.al. 2404.08631v1 null
2024-04-12 Classification of Boolean Algebras through von Neumann regular $\mathcal{C}^{\infty}-$Rings Jean Cerqueira Berni et.al. 2404.08629v1 null
2024-04-12 Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation Yanhao Zheng et.al. 2404.08603v1 link
2024-04-12 Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation Abu Bakor Hayat Arnob et.al. 2404.08584v1 link
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580v1 null
2024-04-12 IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic Chirag Parikh et.al. 2404.08561v1 null
2024-04-12 Scalability in Building Component Data Annotation: Enhancing Facade Material Classification with Synthetic Data Josie Harrison et.al. 2404.08557v1 null
2024-04-12 Benchmarking the Cell Image Segmentation Models Robustness under the Microscope Optical Aberrations Boyuan Peng et.al. 2404.08549v1 null
2024-04-12 VertAttack: Taking advantage of Text Classifiers' horizontal vision Jonathan Rusert et.al. 2404.08538v1 null
2024-04-12 Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection Zhiwei Yang et.al. 2404.08531v1 null
2024-04-11 Connecting NeRFs, Images, and Text Francesco Ballerini et.al. 2404.07993v1 null
2024-04-11 GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh Jing Wen et.al. 2404.07991v1 null
2024-04-11 WaveMo: Learning Wavefront Modulations to See Through Scattering Mingyang Xie et.al. 2404.07985v1 null
2024-04-11 Gaga: Group Any Gaussians via 3D-aware Memory Bank Weijie Lyu et.al. 2404.07977v1 null
2024-04-11 FusionMamba: Efficient Image Fusion with State Space Model Siran Peng et.al. 2404.07932v1 null
2024-04-11 HGRN2: Gated Linear RNNs with State Expansion Zhen Qin et.al. 2404.07904v1 link
2024-04-11 Q-ITAGS: Quality-Optimized Spatio-Temporal Heterogeneous Task Allocation with a Time Budget Glen Neville et.al. 2404.07902v1 null
2024-04-11 Auditing health-related recommendations in social media: A Case Study of Abortion on YouTube Mohammed Lahsaini et.al. 2404.07896v1 null
2024-04-11 Typical blocks of the category $\mathcal O$ and Whittaker modules for Takiff superalgebras Chih-Whi Chen et.al. 2404.07894v1 null
2024-04-11 Context-aware Video Anomaly Detection in Long-Term Datasets Zhengye Yang et.al. 2404.07887v1 null
2024-04-10 RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Jaidev Shriram et.al. 2404.07199v1 null
2024-04-10 GCV-Turbo: End-to-end Acceleration of GNN-based Computer Vision Tasks on FPGA Bingyi Zhang et.al. 2404.07188v1 null
2024-04-10 Adinkras and Pure Spinors Richard Eager et.al. 2404.07167v1 null
2024-04-10 Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations Ofir Shifman et.al. 2404.07153v1 null
2024-04-10 Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization Michael Kohler et.al. 2404.07128v1 null
2024-04-10 Measuring proximity to standard planes during fetal brain ultrasound scanning Chiara Di Vece et.al. 2404.07124v1 null
2024-04-10 "My toxic trait is thinking I'll remember this": gaps in the learner experience of video tutorials for feature-rich software Ian Drosos et.al. 2404.07114v1 null
2024-04-10 The generic dual of p-adic groups and applications Chris Jantzen et.al. 2404.07111v1 null
2024-04-10 Learning Priors for Non Rigid SfM from Casual Videos Yoni Kasten et.al. 2404.07097v1 null
2024-04-10 VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning Alexandros Xenos et.al. 2404.07078v1 link
2024-04-09 MoReVQA: Exploring Modular Reasoning Models for Video Question Answering Juhong Min et.al. 2404.06511v1 null
2024-04-10 Reconstructing Hand-Held Objects in 3D Jane Wu et.al. 2404.06507v2 null
2024-04-09 A Machine Learning Framework for the Prediction of Grain Boundary Segregation in Chemically Complex Environments Doruk Aksoy et.al. 2404.06499v1 null
2024-04-10 Flying with Photons: Rendering Novel Views of Propagating Light Anagh Malik et.al. 2404.06493v2 null
2024-04-09 Uncovering Tidal Treasures: Automated Classification of Faint Tidal Features in DECaLS Data Alexander J. Gordon et.al. 2404.06487v1 null
2024-04-09 RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos Bochao Zou et.al. 2404.06483v1 null
2024-04-09 Laue Indexing with Optimal Transport Tomasz Kacprzak et.al. 2404.06478v1 link
2024-04-09 A comparative analysis of deep learning models for lung segmentation on X-ray images Weronika Hryniewska-Guzik et.al. 2404.06455v1 link
2024-04-09 QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding Yash Mehan et.al. 2404.06442v1 null
2024-04-09 ClassiPyGRB: Machine Learning-Based Classification and Visualization of Gamma Ray Bursts using t-SNE Keneth Garcia-Cifuentes et.al. 2404.06439v1 null
2024-04-08 MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Bo He et.al. 2404.05726v1 null
2024-04-08 Predicting Overtakes in Trucks Using CAN Data Talha Hanif Butt et.al. 2404.05723v1 null
2024-04-08 Case Study: Neural Network Malware Detection Verification for Feature and Image Datasets Preston K. Robinette et.al. 2404.05703v1 null
2024-04-08 Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding Ahmad Idrissi-Yaghir et.al. 2404.05694v1 null
2024-04-08 Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery Ionut M. Motoi et.al. 2404.05693v1 null
2024-04-08 AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation Jiannan Ge et.al. 2404.05667v1 null
2024-04-08 Oblique photons, plasmons, and current-plasmons in relativistic plasmas and their topological implications Hong Qin et.al. 2404.05636v1 null
2024-04-08 AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets Pietro Lesci et.al. 2404.05623v1 null
2024-04-08 Experimental observation of a time rondeau crystal: Temporal Disorder in Spatiotemporal Order Leo Joon Il Moon et.al. 2404.05620v1 null
2024-04-08 Self-Explainable Affordance Learning with Embodied Caption Zhipeng Zhang et.al. 2404.05603v1 null
2024-04-05 On classification of global dynamics for energy-critical equivariant harmonic map heat flows and radial nonlinear heat equation Kihyun Kim et.al. 2404.04247v1 null
2024-04-05 Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism Trilokesh Ranjan Sarkar et.al. 2404.04245v1 null
2024-04-05 player2vec: A Language Modeling Approach to Understand Player Behavior in Games Tianze Wang et.al. 2404.04234v1 null
2024-04-05 Deep-learning Segmentation of Small Volumes in CT images for Radiotherapy Treatment Planning Jianxin Zhou et.al. 2404.04202v1 null
2024-04-05 SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers Weile Li et.al. 2404.04179v1 link
2024-04-05 Noisy Label Processing for Classification: A Survey Mengting Li et.al. 2404.04159v1 null
2024-04-05 Improving Detection in Aerial Images by Capturing Inter-Object Relationships Botao Ren et.al. 2404.04140v1 null
2024-04-05 Label Propagation for Zero-shot Classification with Vision-Language Models Vladan Stojnić et.al. 2404.04072v1 link
2024-04-05 VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots Akhil Padmanabha et.al. 2404.04066v1 null
2024-04-05 Phase Binarization in Mutually Synchronized Bias Field-free Spin Hall Nano-oscillators for Reservoir Computing Sourabh Manna et.al. 2404.04023v1 null
2024-04-04 OW-VISCap: Open-World Video Instance Segmentation and Captioning Anwesa Choudhuri et.al. 2404.03657v1 null
2024-04-04 Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation Shuting He et.al. 2404.03645v1 link
2024-04-04 On the Efficiency of Convolutional Neural Networks Andrew Lavin et.al. 2404.03617v1 null
2024-04-04 Creator Hearts: Investigating the Impact Positive Signals from YouTube Creators in Shaping Comment Section Behavior Frederick Choi et.al. 2404.03612v1 null
2024-04-04 InsectMamba: Insect Pest Classification with State Space Model Qianning Wang et.al. 2404.03611v1 null
2024-04-04 DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images Zhou Jie et.al. 2404.03595v1 link
2024-04-04 Alzheimer's disease detection in PSG signals Lorena Gallego-Viñarás et.al. 2404.03549v1 null
2024-04-04 Towards Transcranial 3D Ultrasound Localization Microscopy of the Nonhuman Primate Brain Paul Xing et.al. 2404.03547v1 null
2024-04-04 Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models Siyuan Mei et.al. 2404.03541v1 null
2024-04-05 A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data Iqra Bano et.al. 2404.03493v2 null
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 null
2024-04-03 Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds Kamalika Chaudhuri et.al. 2404.02866v1 link
2024-04-03 Semisimple Algebras of Vector Fields on $\mathbb{C}^{3}$ Sajid Ali et.al. 2404.02847v1 null
2024-04-03 GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation Meher Niger et.al. 2404.02813v1 null
2024-04-03 Generative-Contrastive Heterogeneous Graph Neural Network Yu Wang et.al. 2404.02810v1 null
2024-04-03 FPT: Feature Prompt Tuning for Few-shot Readability Assessment Ziyang Wang et.al. 2404.02772v1 link
2024-04-03 DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement Hao Wu et.al. 2404.02755v1 null
2024-04-03 Terraced Compression Method with Automated Threshold Selection for Multidimensional Image Clustering of Heterogeneous Bodies Jiatong Li et.al. 2404.02744v1 null
2024-04-03 Event Camera Demosaicing via Swin Transformer and Pixel-focus Loss Yunfan Lu et.al. 2404.02731v1 link
2024-04-03 Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps via LLM Zhe Liu et.al. 2404.02706v1 null
2024-04-02 Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 Multiparametric quantification and visualization of liver fat using ultrasound Jihye Baek et.al. 2404.02143v1 null
2024-04-03 ResNet with Integrated Convolutional Block Attention Module for Ship Classification Using Transfer Learning on Optical Satellite Imagery Ryan Donghan Kwon et.al. 2404.02135v2 null
2024-04-02 ViTamin: Designing Scalable Vision Models in the Vision-Language Era Jienneg Chen et.al. 2404.02132v1 link
2024-04-02 ImageNot: A contrast with ImageNet preserves model rankings Olawale Salaudeen et.al. 2404.02112v1 null
2024-04-02 CameraCtrl: Enabling Camera Control for Text-to-Video Generation Hao He et.al. 2404.02101v1 link
2024-04-02 Explainability in JupyterLab and Beyond: Interactive XAI Systems for Integrated and Collaborative Workflows Grace Guo et.al. 2404.02081v1 null
2024-04-02 Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation Hui Xiao et.al. 2404.02065v1 null
2024-04-02 Long-context LLMs Struggle with Long In-context Learning Tianle Li et.al. 2404.02060v1 link
2024-04-02 Deconstructing In-Context Learning: Understanding Prompts via Corruption Namrata Shivagunde et.al. 2404.02054v1 link
2024-03-29 Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations Jaisidh Singh et.al. 2403.20312v1 link
2024-03-29 Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation Fangxu Yu et.al. 2403.20289v1 link
2024-03-29 Prototype-based Interpretable Breast Cancer Prediction Models: Analysis and Challenges Shreyasi Pathak et.al. 2403.20260v1 null
2024-03-29 Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions Runhao Zeng et.al. 2403.20254v1 null
2024-03-29 Latent Embedding Clustering for Occlusion Robust Head Pose Estimation José Celestino et.al. 2403.20251v1 null
2024-03-29 Long-Tailed Anomaly Detection with Learnable Class Names Chih-Hui Ho et.al. 2403.20236v1 null
2024-04-02 Artificial Neural Networks-based Real-time Classification of ENG Signals for Implanted Nerve Interfaces Antonio Coviello et.al. 2403.20234v2 null
2024-03-29 MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark Sanghyun Woo et.al. 2403.20225v1 null
2024-03-29 Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science Yazheng Yang et.al. 2403.20208v1 null
2024-03-29 The Future of Combating Rumors? Retrieval, Discrimination, and Generation Junhao Xu et.al. 2403.20204v1 null
2024-03-28 RSMamba: Remote Sensing Image Classification with State Space Model Keyan Chen et.al. 2403.19654v1 link
2024-03-28 Square patterns in dynamical orbits Vefa Goksel et.al. 2403.19642v1 null
2024-03-28 Siamese Vision Transformers are Scalable Audio-visual Learners Yan-Bo Lin et.al. 2403.19638v1 null
2024-03-28 Four-dimensional gradient Ricci solitons with (half) nonnegative isotropic curvature Huai-Dong Cao et.al. 2403.19627v1 null
2024-03-28 Top-$k$ Classification and Cardinality-Aware Prediction Anqi Mao et.al. 2403.19625v1 null
2024-03-28 RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents Zeren Chen et.al. 2403.19622v1 null
2024-03-28 SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects Avinash Ummadisingu et.al. 2403.19607v1 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600v1 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593v1 null
2024-03-28 Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation Zhongliang Zhou et.al. 2403.19584v1 null
2024-03-27 MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering Guoxing Sun et.al. 2403.18820v1 null
2024-03-27 Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction Yiyao Zhang et.al. 2403.18776v1 null
2024-03-27 CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning Elliot Chane-Sane et.al. 2403.18765v1 null
2024-03-27 A vascular synthetic model for improved aneurysm segmentation and detection via Deep Neural Networks Rafic Nader et.al. 2403.18734v1 null
2024-03-27 Contrastive Learning with Orthonormal Anchors (CLOA) Huanran Li et.al. 2403.18699v1 null
2024-03-27 Annolid: Annotate, Segment, and Track Anything You Need Chen Yang et.al. 2403.18690v1 null
2024-03-27 InceptionTime vs. Wavelet -- A comparison for time series classification Daniel Klenkert et.al. 2403.18687v1 null
2024-03-27 TransFusion: Contrastive Learning with Transformers Huanran Li et.al. 2403.18681v1 null
2024-03-28 FluxGAT: Integrating Flux Sampling with Graph Neural Networks for Unbiased Gene Essentiality Classification Kieren Sharma et.al. 2403.18666v2 null
2024-03-27 Indecomposable set-theoretical solutions to the Yang-Baxter equation of size $p^2$ Carsten Dietzel et.al. 2403.18653v1 null
2024-03-26 Efficient Video Object Segmentation via Modulated Cross-Attention Memory Abdelrahman Shaker et.al. 2403.17937v1 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936v1 null
2024-03-26 OmniVid: A Generative Framework for Universal Video Understanding Junke Wang et.al. 2403.17935v1 link
2024-03-26 Track Everything Everywhere Fast and Robustly Yunzhou Song et.al. 2403.17931v1 null
2024-03-26 FastCAR: Fast Classification And Regression Multi-Task Learning via Task Consolidation for Modelling a Continuous Property Variable of Object Classes Anoop Kini et.al. 2403.17926v1 null
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921v1 link
2024-03-26 TC4D: Trajectory-Conditioned Text-to-4D Generation Sherwin Bahmani et.al. 2403.17920v1 null
2024-03-26 AgentStudio: A Toolkit for Building General Virtual Agents Longtao Zheng et.al. 2403.17918v1 null
2024-03-26 Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos Akshay Paruchuri et.al. 2403.17915v1 null
2024-03-26 Hierarchical Multi-label Classification for Fine-level Event Extraction from Aviation Accident Reports Xinyu Zhao et.al. 2403.17914v1 null
2024-03-25 DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking Yichuan Li et.al. 2403.16786v1 null
2024-03-25 C-arm inverse geometry CT for 3D cardiac chamber mapping Jordan M. Slagowski et.al. 2403.16779v1 null
2024-03-25 Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases Sophie Starck et.al. 2403.16776v1 null
2024-03-25 As Good As A Coin Toss Human detection of AI-generated images, videos, audio, and audiovisual stimuli Di Cooke et.al. 2403.16760v1 null
2024-03-25 Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein et.al. 2403.16736v1 null
2024-03-25 A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models Nils Ingelhag et.al. 2403.16730v1 null
2024-03-25 One-Shot Domain Incremental Learning Yasushi Esaki et.al. 2403.16707v1 null
2024-03-25 Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer Dominik Müller et.al. 2403.16695v1 null
2024-03-25 DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks Dominik Müller et.al. 2403.16678v1 link
2024-03-25 FOOL: Addressing the Downlink Bottleneck in Satellite Computing with Neural Feature Compression Alireza Furutanpey et.al. 2403.16677v1 null
2024-03-25 A Novel Loss Function-based Support Vector Machine for Binary Classification Yan Li et.al. 2403.16654v1 null
2024-03-25 Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution Qingping Zheng et.al. 2403.16643v1 null
2024-03-25 Multi-Scale Texture Loss for CT denoising with GANs Francesco Di Feola et.al. 2403.16640v1 link
2024-03-25 AI-Generated Video Detection via Spatio-Temporal Anomaly Learning Jianfa Bai et.al. 2403.16638v1 null
2024-03-25 Distributed collaborative anomalous sound detection by embedding sharing Kota Dohi et.al. 2403.16610v1 null
2024-03-25 EDUE: Expert Disagreement-Guided One-Pass Uncertainty Estimation for Medical Image Segmentation Kudaibergen Abutalip et.al. 2403.16594v1 null
2024-03-22 LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Yuzhang Shang et.al. 2403.15388v1 null
2024-03-22 Time-efficient, high-resolution 3T whole-brain relaxometry using Cartesian 3D MR-STAT with CSF suppression Hongyan Liu et.al. 2403.15379v1 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378v1 null
2024-03-22 InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Yi Wang et.al. 2403.15377v1 null
2024-03-22 Cascading Blackout Severity Prediction with Statistically-Augmented Graph Neural Networks Joe Gorka et.al. 2403.15363v1 null
2024-03-22 SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series Badri N. Patro et.al. 2403.15360v1 null
2024-03-22 Ultrasound Imaging based on the Variance of a Diffusion Restoration Model Yuxin Zhang et.al. 2403.15316v1 null
2024-03-22 Global Control for Local SO(3)-Equivariant Scale-Invariant Vessel Segmentation Patryk Rygiel et.al. 2403.15314v1 null
2024-03-22 Quantum-inspired classification via efficient simulation of Helstrom measurement Wooseop Hwang et.al. 2403.15308v1 null
2024-03-22 Reconnaissance ultracool spectra in the Euclid Deep Fields Jerry Jun-Yan Zhang et.al. 2403.15288v1 null
2024-03-21 Language Repository for Long Video Understanding Kumara Kahatapitiya et.al. 2403.14622v1 link
2024-03-22 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617v2 null
2024-03-21 Explorative Inbetweening of Time and Space Haiwen Feng et.al. 2403.14611v1 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602v1 null
2024-03-21 PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model Zheng Zhang et.al. 2403.14598v1 link
2024-03-21 Large Language Models for Multi-Choice Question Classification of Medical Subjects Víctor Ponce-López et.al. 2403.14582v1 null
2024-03-21 DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video Narek Tumanyan et.al. 2403.14548v1 null
2024-03-21 Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images Tom Burgert et.al. 2403.14547v1 null
2024-03-21 Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets Ahmet Alp Kindiroglu et.al. 2403.14534v1 link
2024-03-21 Invisible Needle Detection in Ultrasound: Leveraging Mechanism-Induced Vibration Chenyang Li et.al. 2403.14523v1 null
2024-03-21 Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting Alicia Durrer et.al. 2403.14499v1 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 Hierarchical NeuroSymbolic Approach for Action Quality Assessment Lauren Okamoto et.al. 2403.13798v1 null
2024-03-20 Bridge the Modality and Capacity Gaps in Vision-Language Model Selection Chao Yi et.al. 2403.13797v1 null
2024-03-20 The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency and Usability in AI Matt White et.al. 2403.13784v1 null
2024-03-20 Gradings on associative triple systems of the second kind Alberto Daza-Garcia et.al. 2403.13775v1 null
2024-03-20 Towards Principled Representation Learning from Videos for Reinforcement Learning Dipendra Misra et.al. 2403.13765v1 null
2024-03-20 Enhancing Gait Video Analysis in Neurodegenerative Diseases by Knowledge Augmentation in Vision Language Model Diwei Wang et.al. 2403.13756v1 null
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745v1 null
2024-03-20 Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes Yifan Chen et.al. 2403.13724v1 null
2024-03-20 Improving the Adaptive Moment Estimation (ADAM) stochastic optimizer through an Implicit-Explicit (IMEX) time-stepping approach Abhinab Bhattacharjee et.al. 2403.13704v1 null
2024-03-19 LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression Zhuoshi Pan et.al. 2403.12968v1 null
2024-03-19 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation Shuai Yang et.al. 2403.12962v1 link
2024-03-19 WHAC: World-grounded Humans and Cameras Wanqi Yin et.al. 2403.12959v1 null
2024-03-19 FutureDepth: Learning to Predict the Future Improves Video Depth Estimation Rajeev Yasarla et.al. 2403.12953v1 null
2024-03-19 Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models Elaine Sui et.al. 2403.12952v1 link
2024-03-19 Legendrian loops and cluster modular groups James Hughes et.al. 2403.12951v1 null
2024-03-19 Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers Vidhi Jain et.al. 2403.12943v1 null
2024-03-19 Contextual AD Narration with Interleaved Multimodal Sequence Hanlin Wang et.al. 2403.12922v1 null
2024-03-19 Semantic Layering in Room Segmentation via LLMs Taehyeon Kim et.al. 2403.12920v1 null
2024-03-19 Yell At Your Robot: Improving On-the-Fly from Language Corrections Lucy Xiaoyang Shi et.al. 2403.12910v1 null
2024-03-18 Time Series Compression using Quaternion Valued Neural Networks and Quaternion Backpropagation Johannes Pöppelbaum et.al. 2403.11722v1 null
2024-03-18 Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing Juan Zhang et.al. 2403.11700v1 null
2024-03-18 A Spatial-Temporal Progressive Fusion Network for Breast Lesion Segmentation in Ultrasound Videos Zhengzheng Tu et.al. 2403.11699v1 null
2024-03-18 Object Segmentation-Assisted Inter Prediction for Versatile Video Coding Zhuoyuan Li et.al. 2403.11694v1 null
2024-03-19 MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation Haoyu Zhao et.al. 2403.11689v2 null
2024-03-18 Better (pseudo-)labels for semi-supervised instance segmentation François Porcher et.al. 2403.11675v1 null
2024-03-19 WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising Haoyu Zhao et.al. 2403.11672v2 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667v1 null
2024-03-18 Combining Local and Global Perception for Autonomous Navigation on Nano-UAVs Lorenzo Lamberti et.al. 2403.11661v1 null
2024-03-18 LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model Yuxin Cao et.al. 2403.11656v1 null
2024-03-15 Strong and Controllable Blind Image Decomposition Zeyu Zhang et.al. 2403.10520v1 link
2024-03-15 Frozen Feature Augmentation for Few-Shot Image Classification Andreas Bär et.al. 2403.10519v1 null
2024-03-15 VideoAgent: Long-form Video Understanding with Large Language Model as Agent Xiaohan Wang et.al. 2403.10517v1 null
2024-03-15 Surveyor: Facilitating Discovery Within Video Games for Blind and Low Vision Players Vishnu Nair et.al. 2403.10512v1 null
2024-03-15 Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study Chenguang Wang et.al. 2403.10499v1 link
2024-03-15 Joint Multimodal Transformer for Dimensional Emotional Recognition in the Wild Paul Waligora et.al. 2403.10488v1 null
2024-03-15 Tensor Star Decomposition Wuyang Zhou et.al. 2403.10481v1 null
2024-03-15 Using an LLM to Turn Sign Spottings into Spoken Language Sentences Ozge Mercanoglu Sincan et.al. 2403.10434v1 null
2024-03-15 Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks Peter Leer et.al. 2403.10420v1 null
2024-03-15 A comparative study on machine learning approaches for rock mass classification using drilling data Tom F. Hansen et.al. 2403.10404v1 null
2024-03-14 Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models Akhil Kedia et.al. 2403.09635v1 link
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 From the Conformal Anomaly to the Virasoro Algebra Sid Maibach et.al. 2403.09628v1 null
2024-03-14 Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Guo Chen et.al. 2403.09626v1 link
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623v1 link
2024-03-14 PosSAM: Panoptic Open-vocabulary Segment Anything Vibashan VS et.al. 2403.09620v1 null
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616v1 null
2024-03-14 Compute-first optical detection for noise-resilient visual perception Jungmin Kim et.al. 2403.09612v1 null
2024-03-14 Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds Ilyass Moummad et.al. 2403.09598v1 link
2024-03-14 DungeonMaker: Embedding Tangible Creation and Destruction in Hybrid Board Games through Personal Fabrication Technology Evgeny Stemasov et.al. 2403.09592v1 null
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-13 Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches Yun Xin Teoh et.al. 2403.08761v1 null
2024-03-13 MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning Jialv Zou et.al. 2403.08760v1 link
2024-03-13 Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08758v1 null
2024-03-13 DAM: Dynamic Adapter Merging for Continual Video QA Learning Feng Cheng et.al. 2403.08755v1 link
2024-03-13 Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08749v1 null
2024-03-13 Torsion pairs, t-structures, and co-t-structures for completions of discrete cluster categories Sofia Franchini et.al. 2403.08735v1 null
2024-03-13 Euclid: Testing photometric selection of emission-line galaxy targets M. S. Cagliari et.al. 2403.08726v1 null
2024-03-13 Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment Paraskevas Pegios et.al. 2403.08700v1 null
2024-03-13 Implicit Regularization of Gradient Flow on One-Layer Softmax Attention Heejune Sheen et.al. 2403.08699v1 null
2024-03-12 OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation Aadhithya Iyer et.al. 2403.07870v1 null
2024-03-12 TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation Shivin Dass et.al. 2403.07869v1 null
2024-03-12 Iterative Graph Neural Network Enhancement via Frequent Subgraph Mining of Explanations Harish G. Naik et.al. 2403.07849v1 null
2024-03-12 When Eye-Tracking Meets Machine Learning: A Systematic Review on Applications in Medical Image Analysis Sahar Moradizeyveh et.al. 2403.07834v1 null
2024-03-12 DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies William Xie et.al. 2403.07832v1 null
2024-03-12 A geometric model for the module category of a string algebra Karin Baur et.al. 2403.07810v1 null
2024-03-12 BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives Ivo M. Baltruschat et.al. 2403.07800v1 null
2024-03-12 A robust SVM-based approach with feature selection and outliers detection for classification problems Marta Baldomero-Naranjo et.al. 2403.07753v1 null
2024-03-12 Vision-based Vehicle Re-identification in Bridge Scenario using Flock Similarity Chunfeng Zhang et.al. 2403.07752v1 null
2024-03-12 Harnessing two-photon dissipation for enhanced quantum measurement and control Antoine Marquet et.al. 2403.07744v1 null
2024-03-11 Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling Wele Gedara Chaminda Bandara et.al. 2403.06978v1 link
2024-03-12 VideoMamba: State Space Model for Efficient Video Understanding Kunchang Li et.al. 2403.06977v2 link
2024-03-11 Memory-based Adapters for Online 3D Scene Perception Xiuwei Xu et.al. 2403.06974v1 null
2024-03-11 Explainable Transformer Prototypes for Medical Diagnoses Ugur Demir et.al. 2403.06961v1 link
2024-03-11 Quadruped-Frog: Rapid Online Optimization of Continuous Quadruped Jumping Guillaume Bellegarda et.al. 2403.06954v1 null
2024-03-11 Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer Siddhant Satyanaik et.al. 2403.06953v1 null
2024-03-11 Advancing Generalizable Remote Physiological Measurement through the Integration of Explicit and Implicit Prior Knowledge Yuting Zhang et.al. 2403.06947v1 link
2024-03-11 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction Qing Xiao et.al. 2403.06940v1 null
2024-03-11 FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks Muhammad Saif Ullah Khan et.al. 2403.06904v1 null
2024-03-11 Benign overfitting in leaky ReLU networks with moderate input dimension Kedar Karhadkar et.al. 2403.06903v1 null
2024-03-08 Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos Tarun Kalluri et.al. 2403.05535v1 null
2024-03-08 Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets Lorenzo Brigato et.al. 2403.05532v1 null
2024-03-08 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Machel Reid et.al. 2403.05530v1 null
2024-03-08 Take Your Best Shot: Sampling-Based Next-Best-View Planning for Autonomous Photography & Inspection Shijie Gao et.al. 2403.05477v1 null
2024-03-08 Will GPT-4 Run DOOM? Adrian de Wynter et.al. 2403.05468v1 null
2024-03-08 Evaluating AI and Human Authorship Quality in Academic Writing through Physics Essays Will Yeadon et.al. 2403.05458v1 null
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438v1 link
2024-03-08 OmniCount: Multi-label Object Counting with Semantic-Geometric Priors Anindya Mondal et.al. 2403.05435v1 null
2024-03-08 Infinite Translation Surfaces in the Wild Vincent Delecroix et.al. 2403.05424v1 null
2024-03-08 Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery Mubashir Noman et.al. 2403.05419v1 link
2024-03-07 DeepSee: Multidimensional Visualizations of Seabed Ecosystems Adam Coscia et.al. 2403.04761v1 link
2024-03-07 iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries Adam Coscia et.al. 2403.04760v1 link
2024-03-07 KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts Adam Coscia et.al. 2403.04758v1 link
2024-03-07 Preliminary Guidelines For Combining Data Integration and Visual Data Analysis Adam Coscia et.al. 2403.04757v1 link
2024-03-07 Photonic probabilistic machine learning using quantum vacuum noise Seou Choi et.al. 2403.04731v1 null
2024-03-07 Analysis of Systems' Performance in Natural Language Processing Competitions Sergio Nava-Muñoz et.al. 2403.04693v1 null
2024-03-07 CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios Qilang Ye et.al. 2403.04640v1 link
2024-03-07 Scalable, Simulation-Guided Compliant Tactile Finger Design Yuxiang Ma et.al. 2403.04638v1 null
2024-03-08 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634v2 null
2024-03-07 MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder Lei Li et.al. 2403.04626v1 null
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954v1 link
2024-03-06 Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Jesse Farebrother et.al. 2403.03950v1 null
2024-03-06 Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation Marcel Torne et.al. 2403.03949v1 null
2024-03-06 DART: Implicit Doppler Tomography for Radar Novel View Synthesis Tianshu Huang et.al. 2403.03896v1 null
2024-03-06 Joint multi-task learning improves weakly-supervised biomarker prediction in computational pathology Omar S. M. El Nahhas et.al. 2403.03891v1 link
2024-03-06 Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation Xiao Ma et.al. 2403.03890v1 null
2024-03-06 Decoupled Vertical Federated Learning for Practical Training on Vertically Partitioned Data Avi Amalanshu et.al. 2403.03871v1 null
2024-03-06 X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification Hanzi Xu et.al. 2403.03863v1 link
2024-03-06 ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction Luke Lozenski et.al. 2403.03860v1 null
2024-03-06 MedMamba: Vision Mamba for Medical Image Classification Yubiao Yue et.al. 2403.03849v1 link
2024-03-05 Extension Theory and Fermionic Strongly Fusion 2-Categories Thibault D. Décoppet et.al. 2403.03211v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 Behavior Generation with Latent Actions Seungjae Lee et.al. 2403.03181v1 link
2024-03-05 Deep-Learned Compression for Radio-Frequency Signal Classification Armani Rodriguez et.al. 2403.03150v1 null
2024-03-05 Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization Yuxin Guo et.al. 2403.03145v1 link
2024-03-05 Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation Robert Mendel et.al. 2403.03120v1 null
2024-03-05 Equilibria in Two-Stage Facility Location with Atomic Clients Simon Krogmann et.al. 2403.03114v1 null
2024-03-05 Galaxies in the Zone of Avoidance: Misclassifications using machine learning tools P. Marchant Cortés et.al. 2403.03098v1 null
2024-03-05 Collective self-caging of active filaments in virtual confinement Maximilian Kurjahn et.al. 2403.03093v1 null
2024-03-05 A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives Simone Alberto Peirone et.al. 2403.03037v1 null
2024-03-03 Enhancing Retinal Vascular Structure Segmentation in Images With a Novel Design Two-Path Interactive Fusion Module Model Rui Yang et.al. 2403.01362v1 null
2024-03-02 Improve Cost Efficiency of Active Learning over Noisy Dataset Zan-Kai Chong et.al. 2403.01346v1 null
2024-03-02 An eternal hypersurface flow arising in centro-affine geometry Xinjie Jiang et.al. 2403.01340v1 null
2024-03-02 Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System Assylzhan Izbassar et.al. 2403.01310v1 null
2024-03-02 VNLP: Turkish NLP Package Meliksah Turker et.al. 2403.01309v1 null
2024-03-02 Towards a classification of $p^2$-discriminant ideal twins over number fields Alyson Deines et.al. 2403.01287v1 null
2024-03-02 $π$-systems and the Embedding problem for rank $2$ Kac-Moody Lie algebras Irfan Habib et.al. 2403.01285v1 null
2024-03-02 Fast Low-parameter Video Activity Localization in Collaborative Learning Environments Venkatesh Jatla et.al. 2403.01281v1 null
2024-03-02 Rigidity results for group von Neumann algebras with diffuse center Ionuţ Chifan et.al. 2403.01280v1 null
2024-03-02 Can a Confident Prior Replace a Cold Posterior? Martin Marek et.al. 2403.01272v1 link
2024-02-29 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Tsai-Shien Chen et.al. 2402.19479v1 null
2024-02-29 Towards Generalizable Tumor Synthesis Qi Chen et.al. 2402.19470v1 null
2024-02-29 Humanoid Locomotion as Next Token Prediction Ilija Radosavovic et.al. 2402.19469v1 null
2024-03-01 TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning Kate Sanders et.al. 2402.19467v2 null
2024-02-29 Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models Frederik Kunstner et.al. 2402.19449v1 null
2024-02-29 Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems Quentin Raymondaud et.al. 2402.19443v1 null
2024-02-29 Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation Jonathan Yang et.al. 2402.19432v1 null
2024-02-29 PaECTER: Patent-level Representation Learning using Citation-informed Transformers Mainak Ghosh et.al. 2402.19411v1 null
2024-02-29 Navigating Hallucinations for Reasoning of Unintentional Activities Shresth Grover et.al. 2402.19405v1 null
2024-02-29 A Newborn AGN in a Starforming Galaxy P. Arévalo et.al. 2402.19403v1 null
2024-02-28 Time-efficient filtering of polarimetric data by checking physical realizability of experimental Mueller matrices Tatiana Novikova et.al. 2402.18555v1 null
2024-02-28 Selection of appropriate multispectral camera exposure settings and radiometric calibration methods for applications in phenotyping and precision agriculture Vaishali Swaminathan et.al. 2402.18553v1 null
2024-02-28 Implicit Bias of Next-Token Prediction Christos Thrampoulidis et.al. 2402.18551v1 null
2024-02-28 Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures Andrei Cozma et.al. 2402.18527v1 null
2024-02-28 Do galaxy mergers prefer under-dense environments? U. Sureshkumar et.al. 2402.18520v1 null
2024-02-28 Log Neural Controlled Differential Equations: The Lie Brackets Make a Difference Benjamin Walker et.al. 2402.18512v1 null
2024-02-28 Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling Mahdi Karami et.al. 2402.18508v1 null
2024-02-28 Detection of Micromobility Vehicles in Urban Traffic Videos Khalil Sabri et.al. 2402.18503v1 link
2024-02-28 Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification Garima Chhikara et.al. 2402.18502v1 null
2024-02-28 ROG$_{PL}$: Robust Open-Set Graph Learning via Region-Based Prototype Learning Qin Zhang et.al. 2402.18495v1 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768v1 null
2024-02-27 Towards Optimal Learning of Language Models Yuxian Gu et.al. 2402.17759v1 null
2024-02-27 An Eye Gaze Heatmap Analysis of Uncertainty Head-Up Display Designs for Conditional Automated Driving Michael A. Gerber et.al. 2402.17751v1 null
2024-02-27 Scaling on-chip photonic neural processors using arbitrarily programmable wave propagation Tatsuhiro Onodera et.al. 2402.17750v1 link
2024-02-27 Linking Order to Strength in Metals Nicolas Argibay et.al. 2402.17728v1 null
2024-02-27 MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation Hanan Gani et.al. 2402.17725v1 link
2024-02-27 Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners Yazhou Xing et.al. 2402.17723v1 null
2024-02-27 Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers Yiwei Lu et.al. 2402.17710v1 null
2024-02-27 NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents Tamara Czinczoll et.al. 2402.17682v1 null
2024-02-27 MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning Huiyu Xiong et.al. 2402.17680v1 null
2024-02-26 Open Your Ears to Take a Look: A State-of-the-Art Report on the Integration of Sonification and Visualization Kajetan Enge et.al. 2402.16558v1 null
2024-02-26 LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification Yiping Song et.al. 2402.16515v1 null
2024-02-26 Photonic Neural Network Fabricated on Thin Film Lithium Niobate for High-Fidelity and Power-Efficient Matrix Computation Yong Zheng et.al. 2402.16513v1 null
2024-02-26 Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification Ahmad Saeed et.al. 2402.16486v1 null
2024-02-26 Edge Detectors Can Make Deep Convolutional Neural Networks More Robust Jin Ding et.al. 2402.16479v1 null
2024-02-26 Autonomous Integration of TSN-unaware Applications with QoS Requirements in TSN Networks Moritz Fluechter et.al. 2402.16454v1 null
2024-02-26 Retrouver l'inventeur-auteur : la lev{é}e d'homonymies d'autorat entre les brevets et les publications scientifiques David Reymond et.al. 2402.16440v1 null
2024-02-26 Improving behavior based authentication against adversarial attack using XAI Dong Qin et.al. 2402.16430v1 null
2024-02-26 Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction Wen-Yang Lu et.al. 2402.16371v1 null
2024-02-26 DEYO: DETR with YOLO for End-to-End Object Detection Haodong Ouyang et.al. 2402.16370v1 null
2024-02-26 SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance Segmentation Hendrik Möller et.al. 2402.16368v1 link
2024-02-26 An Integrated Data Processing Framework for Pretraining Foundation Models Yiding Sun et.al. 2402.16358v1 link
2024-02-26 What Text Design Characterizes Book Genres? Daichi Haraguchi et.al. 2402.16356v1 null
2024-02-23 A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends Abolfazl Younesi et.al. 2402.15490v1 null
2024-02-23 Retinotopic Mapping Enhances the Robustness of Convolutional Neural Networks Jean-Nicolas Jérémie et.al. 2402.15480v1 null
2024-02-23 FAIR: Filtering of Automatically Induced Rules Divya Jyoti Bajpai et.al. 2402.15472v1 null
2024-02-23 GROS: A General Robust Aggregation Strategy Alejandro Cholaquidis et.al. 2402.15442v1 null
2024-02-23 Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales Shuren Qi et.al. 2402.15430v1 link
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429v1 link
2024-02-23 Understanding Entrainment in Human Groups: Optimising Human-Robot Collaboration from Lessons Learned during Human-Human Collaboration Eike Schneiders et.al. 2402.15427v1 null
2024-02-23 PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning Simon Holk et.al. 2402.15420v1 null
2024-02-23 G-RepsNet: A Fast and General Construction of Equivariant Networks for Arbitrary Matrix Groups Sourya Basu et.al. 2402.15413v1 null
2024-02-23 A Universal Method for Solar Filament Detection from H-alpha Observations using Semi-supervised Deep Learning Andrea Diercke et.al. 2402.15407v1 null
2024-02-22 Link Prediction under Heterophily: A Physics-Inspired Graph Neural Network Approach Andrea Giuseppe Di Francesco et.al. 2402.14802v1 null
2024-02-22 Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Willi Menapace et.al. 2402.14797v1 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780v1 null
2024-02-22 Zero-Shot Pediatric Tuberculosis Detection in Chest X-Rays using Self-Supervised Learning Daniel Capellán-Martín et.al. 2402.14741v1 null
2024-02-22 Solitons of the mean curvature flow in $\mathbb{s}^2\times\mathbb{R}$ Rafael López et.al. 2402.14727v1 null
2024-02-22 A Transformer Model for Boundary Detection in Continuous Sign Language Razieh Rastgoo et.al. 2402.14720v1 null
2024-02-22 InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks Somnath Banerjee et.al. 2402.14702v1 null
2024-02-22 Big data analytics to classify earthwork-related locations: A Chengdu study Lei Yu et.al. 2402.14698v1 null
2024-02-22 Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off Futa Waseda et.al. 2402.14648v1 null
2024-02-22 Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving Eugen Šlapak et.al. 2402.14642v1 null
2024-02-21 A Simple and Yet Fairly Effective Defense for Graph Neural Networks Sofiane Ennadir et.al. 2402.13987v1 link
2024-02-21 On modular representations of inner forms of $\mathrm{GL}_n$ over a local non-archimedean field Johannes Droschl et.al. 2402.13969v1 null
2024-02-21 New directions in algebraic statistics: Three challenges from 2023 Yulia Alexandr et.al. 2402.13961v1 null
2024-02-21 On the topological classification of complex plane curve singularities Alberto Fernández-Hernández et.al. 2402.13941v1 null
2024-02-21 Verifying message-passing neural networks via topology-based bounds tightening Christopher Hojny et.al. 2402.13937v1 null
2024-02-21 Tumor segmentation on whole slide images: training or prompting? Huaqian Wu et.al. 2402.13932v1 null
2024-02-21 BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery Loddo Fabio et.al. 2402.13918v1 link
2024-02-21 An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach Mohammad Amaz Uddin et.al. 2402.13871v1 null
2024-02-21 RFI-DRUnet: Restoring dynamic spectra corrupted by radio frequency interference -- Application to pulsar observations Xiao Zhang et.al. 2402.13867v1 null
2024-02-21 What we can learn from TikTok through its Research API Francesco Corso et.al. 2402.13855v1 null
2024-02-20 Video ReCap: Recursive Captioning of Hour-Long Videos Md Mohaiminul Islam et.al. 2402.13250v1 null
2024-02-20 SMORE: Similarity-based Hyperdimensional Domain Adaptation for Multi-Sensor Time Series Classification Junyao Wang et.al. 2402.13233v1 null
2024-02-20 A Touch, Vision, and Language Dataset for Multimodal Alignment Letian Fu et.al. 2402.13232v1 null
2024-02-20 NeRF Solves Undersampled MRI Reconstruction Tae Jun Jang et.al. 2402.13226v1 null
2024-02-20 VideoPrism: A Foundational Visual Encoder for Video Understanding Long Zhao et.al. 2402.13217v1 null
2024-02-20 How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena Marco Gaido et.al. 2402.13208v1 null
2024-02-20 A novel image correction method for cloud-affected observations with Imaging Atmospheric Cherenkov Telescopes Natalia Żywucka et.al. 2402.13190v1 null
2024-02-20 UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing Jianhong Bai et.al. 2402.13185v1 null
2024-02-20 DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models Norman Di Palo et.al. 2402.13181v1 null
2024-02-20 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172v1 null
2024-02-19 Short-Period Variables in TESS Full-Frame Image Light Curves Identified via Convolutional Neural Networks Greg Olmschenk et.al. 2402.12369v1 null
2024-02-19 The first all-sky survey of star-forming galaxies with eROSITA: Scaling relations and a population of X-ray luminous starbursts E. Kyritsis et.al. 2402.12367v1 null
2024-02-19 An Adversarial Approach to Evaluating the Robustness of Event Identification Models Obai Bahwal et.al. 2402.12338v1 null
2024-02-19 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann et.al. 2402.12336v1 link
2024-02-19 Generating Survival Interpretable Trajectories and Data Andrei V. Konstantinov et.al. 2402.12331v1 null
2024-02-19 Asymptotic Gaussian Fluctuations of Eigenvectors in Spectral Clustering Hugo Lebeau et.al. 2402.12302v1 null
2024-02-19 Time-periodic behaviour in one- and two-dimensional interacting particle systems Jonas Köppl et.al. 2402.12300v1 null
2024-02-19 Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports Felix J. Dorfner et.al. 2402.12298v1 null
2024-02-19 Revisiting registration-based synthesis: A focus on unsupervised MR image synthesis Savannah P. Hays et.al. 2402.12288v1 null
2024-02-19 Significance of Chirp MFCC as a Feature in Speech and Audio Applications S. Johanan Joysingh et.al. 2402.12239v1 null
2024-02-16 PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter Junfei Xiao et.al. 2402.10896v1 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894v1 null
2024-02-16 Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation Ziyang Wang et.al. 2402.10887v1 link
2024-02-16 Control Color: Multimodal Diffusion-based Interactive Image Colorization Zhexin Liang et.al. 2402.10855v1 null
2024-02-16 HistoSegCap: Capsules for Weakly-Supervised Semantic Segmentation of Histological Tissue Type in Whole Slide Images Mobina Mansoori et.al. 2402.10851v1 null
2024-02-16 FedD2S: Personalized Data-Free Federated Knowledge Distillation Kawa Atapour et.al. 2402.10846v1 null
2024-02-16 Pedipulate: Enabling Manipulation Skills using a Quadruped Robot's Leg Philip Arm et.al. 2402.10837v1 null
2024-02-16 GAN-driven Electromagnetic Imaging of 2-D Dielectric Scatterers Ehtasham Naseer et.al. 2402.10831v1 null
2024-02-16 Structure results for torus fixed loci Jarod Alper et.al. 2402.10823v1 null
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821v1 link
2024-02-15 Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling Raunaq Bhirangi et.al. 2402.10211v1 null
2024-02-15 FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients Xinchi Qiu et.al. 2402.10191v1 null
2024-02-15 Euclid preparation. Measuring detailed galaxy morphologies for Euclid with Machine Learning Euclid Collaboration et.al. 2402.10187v1 link
2024-02-15 DeepSRGM -- Sequence Classification and Ranking in Indian Classical Music with Deep Learning Sathwik Tejaswi Madhusudhan et.al. 2402.10168v1 null
2024-02-15 Holographic covering and the fortuity of black holes Chi-Ming Chang et.al. 2402.10129v1 null
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095v1 null
2024-02-15 MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations Benedikt Alkin et.al. 2402.10093v1 link
2024-02-15 GraphCBAL: Class-Balanced Active Learning for Graph Neural Networks via Reinforcement Learning Chengcheng Yu et.al. 2402.10074v1 null
2024-02-15 Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence Weixiang Zhao et.al. 2402.10073v1 null
2024-02-15 NYCTALE: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness Prediction Sadaf Khademi et.al. 2402.10066v1 null
2024-02-14 LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning Adithya Raman et.al. 2402.09392v1 null
2024-02-14 GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly Ali Azizpour et.al. 2402.09381v1 link
2024-02-14 Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge Jiancheng Yang et.al. 2402.09372v1 null
2024-02-14 Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma et.al. 2402.09368v1 null
2024-02-14 Small instanton-induced flavor invariants and the axion potential Ravneet Bedi et.al. 2402.09361v1 null
2024-02-14 Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy Brice Rauby et.al. 2402.09359v1 null
2024-02-14 DoRA: Weight-Decomposed Low-Rank Adaptation Shih-Yang Liu et.al. 2402.09353v1 null
2024-02-14 Irreducible representations of the crystallisation of the $C^{*}$-algebra $C(SU_{q}(n+1))$ Manabendra Giri et.al. 2402.09347v1 null
2024-02-14 Registration of Longitudinal Spine CTs for Monitoring Lesion Growth Malika Sanhinova et.al. 2402.09341v1 null
2024-02-14 Stability and Multigroup Fairness in Ranking with Uncertain Predictions Siddartha Devic et.al. 2402.09326v1 null
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-13 A Convergence Analysis of Approximate Message Passing with Non-Separable Functions and Applications to Multi-Class Classification Burak Çakmak et.al. 2402.08676v1 null
2024-02-13 Learning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback Jenny Zhang et.al. 2402.08662v1 null
2024-02-13 BdSLW60: A Word-Level Bangla Sign Language Dataset Husne Ara Rubaiyeat et.al. 2402.08635v1 link
2024-02-13 Convolutional Neural Networks Towards Facial Skin Lesions Detection Reza Sarshar et.al. 2402.08592v1 null
2024-02-13 Totally geodesic submanifolds and polar actions on Stiefel manifolds Claudio Gorodski et.al. 2402.08585v1 null
2024-02-13 Motion-Adaptive Inference for Flexible Learned B-Frame Compression M. Akin Yilmaz et.al. 2402.08550v1 null
2024-02-13 Approximately Piecewise E(3) Equivariant Point Networks Matan Atzmon et.al. 2402.08529v1 null
2024-02-13 Reduced-order modeling of the dynamics of an inverted flag from experimental data Zhenwei Xu et.al. 2402.08504v1 null
2024-02-13 Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models Shaeke Salman et.al. 2402.08473v1 null
2024-02-13 Wavefront Randomization Improves Deconvolution Amit Kohli et.al. 2402.07900v2 null
2024-02-12 Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets Violet Liu et.al. 2402.07895v1 null
2024-02-12 Perfect stable regularity lemma and slice-wise stable hypergraphs Artem Chernikov et.al. 2402.07870v1 null
2024-02-12 On Computationally Efficient Multi-Class Calibration Parikshit Gopalan et.al. 2402.07821v1 null
2024-02-12 A Benchmark Grocery Dataset of Realworld Point Clouds From Single View Shivanand Venkanna Sheshappanavar et.al. 2402.07819v1 null
2024-02-12 Fixation for $\mathcal{U}$-Ising and $\mathcal{U}$-voter dynamics with frozen vertices Laure Marêché et.al. 2402.07807v1 null
2024-02-12 Estimation of non-uniform blur using a patch-based regression convolutional neural network (CNN) Luis G. Varela et.al. 2402.07796v1 null
2024-02-12 "Layer-by-layer" Unsupervised Clustering of Statistically Relevant Fluctuations in Noisy Time-series Data of Complex Dynamical Systems Matteo Becchi et.al. 2402.07786v1 null
2024-02-12 Solving parameter-dependent semi-algebraic systems Louis Gaillard et.al. 2402.07782v1 null
2024-02-12 Observations of the new meteor shower from comet 46P/Wirtanen D. Vida et.al. 2402.07769v1 null
2024-02-09 A two-stage algorithm in evolutionary product unit neural networks for classification Antonio J. Tallón-Ballesteros et.al. 2402.06622v1 null
2024-02-09 Image-based Deep Learning for the time-dependent prediction of fresh concrete properties Max Meyer et.al. 2402.06611v1 null
2024-02-09 SAE: Single Architecture Ensemble Neural Networks Martin Ferianc et.al. 2402.06580v1 null
2024-02-09 Video Annotator: A framework for efficiently building video classifiers using vision-language models and active learning Amir Ziai et.al. 2402.06560v1 link
2024-02-09 Self Supervised Learning for Improved Calibrationless Radial MRI with NLINV-Net Moritz Blumenthal et.al. 2402.06550v1 null
2024-02-09 Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA Marek Šuppa et.al. 2402.06549v1 null
2024-02-09 Feature Density Estimation for Out-of-Distribution Detection via Normalizing Flows Evan D. Cook et.al. 2402.06537v1 null
2024-02-09 Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification Muhammad Uzair Zahid et.al. 2402.06530v1 null
2024-02-09 Flexible infinite-width graph convolutional networks and the importance of representation learning Ben Anson et.al. 2402.06525v1 null
2024-02-09 Dynamic swarms regulate the morphology and distribution of soft membrane domains Aakanksha Gubbala et.al. 2402.06518v1 null
2024-02-08 Classifying Nodes in Graphs without GNNs Daniel Winter et.al. 2402.05934v1 link
2024-02-08 An Interactive Agent Foundation Model Zane Durante et.al. 2402.05929v1 null
2024-02-08 Point-VOS: Pointing Up Video Object Segmentation Idil Esen Zulfikar et.al. 2402.05917v1 null
2024-02-08 A Survey on Detection, Classification, and Tracking of Aerial Threats using Radar and Communications Systems Wahab Khawaja et.al. 2402.05909v1 null
2024-02-09 Large Language Model Meets Graph Neural Network in Knowledge Distillation Shengxiang Hu et.al. 2402.05894v2 null
2024-02-08 Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data Shufan Li et.al. 2402.05892v1 null
2024-02-08 CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion Shoubin Yu et.al. 2402.05889v1 null
2024-02-08 Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers Onur G. Guleryuz et.al. 2402.05887v1 link
2024-02-08 GET-Tok: A GenAI-Enriched Multimodal TikTok Dataset Documenting the 2022 Attempted Coup in Peru Gabriela Pinto et.al. 2402.05882v1 link
2024-02-08 You've Got to Feel It To Believe It: Multi-Modal Bayesian Inference for Semantic and Property Prediction Parker Ewen et.al. 2402.05872v1 null
2024-02-07 Edu-ConvoKit: An Open-Source Library for Education Conversation Data Rose E. Wang et.al. 2402.05111v1 link
2024-02-07 Moduli Parameters of Complex Singularities with Non-Degenerate Newton Boundary Janko Boehm et.al. 2402.05093v1 null
2024-02-07 Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation Ziyang Wang et.al. 2402.05079v1 link
2024-02-07 Arbitrary Scale Super-Resolution Assisted Lunar Crater Detection in Satellite Images Atal Tewari et.al. 2402.05068v1 null
2024-02-07 Efficient Multi-Resolution Fusion for Remote Sensing Data with Label Uncertainty Hersh Vakharia et.al. 2402.05045v1 link
2024-02-07 PAC Learnability under Explanation-Preserving Graph Perturbations Xu Zheng et.al. 2402.05039v1 null
2024-02-07 Strong convexity-guided hyper-parameter optimization for flatter losses Rahul Yedida et.al. 2402.05025v1 null
2024-02-07 Example-based Explanations for Random Forests using Machine Unlearning Tanmay Surve et.al. 2402.05007v1 null
2024-02-07 Randomized Confidence Bounds for Stochastic Partial Monitoring Maxime Heuillet et.al. 2402.05002v1 null
2024-02-07 Beyond explaining: XAI-based Adaptive Learning with SHAP Clustering for Energy Consumption Prediction Tobias Clement et.al. 2402.04982v1 null
2024-02-06 EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters Quan Sun et.al. 2402.04252v1 link
2024-02-06 The spectrum of excisive functors Gregory Arone et.al. 2402.04244v1 null
2024-02-06 A classification of nonzero skew immaculate functions Sarah Mason et.al. 2402.04219v1 null
2024-02-06 Resource-Aware Hierarchical Federated Learning in Wireless Video Caching Networks Md Ferdous Pervej et.al. 2402.04216v1 null
2024-02-06 "Task Success" is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors Lin Guan et.al. 2402.04210v1 null
2024-02-06 3D Volumetric Super-Resolution in Radiology Using 3D RRDB-GAN Juhyung Ha et.al. 2402.04171v1 null
2024-02-06 Human Emotions Analysis and Recognition Using EEG Signals in Response to 360$^\circ$ Videos Haseeb ur Rahman Abbasi et.al. 2402.04142v1 null
2024-02-06 Hierarchical Delay Attribution Classification using Unstructured Text in Train Management Systems Anton Borg et.al. 2402.04108v1 null
2024-02-06 Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction Shijun Liang et.al. 2402.04097v1 null
2024-02-06 A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation Zhengbo Wang et.al. 2402.04087v1 link
2024-02-05 Multiclass Classification Procedure for Detecting Attacks on MQTT-IoT Protocol Hector Alaiz-Moreton et.al. 2402.03270v1 null
2024-02-05 Security Advice for Parents and Children About Content Filtering and Circumvention as Found on YouTube and TikTok Ran Elgedawy et.al. 2402.03255v1 null
2024-02-05 JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching Antoine Magron et.al. 2402.03242v1 link
2024-02-05 FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition Xiaohu Huang et.al. 2402.03241v1 null
2024-02-05 IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images Vincent Roca et.al. 2402.03227v1 null
2024-02-05 English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts Patrick Barreiß et.al. 2402.03223v1 null
2024-02-05 "Define Your Terms" : Enhancing Efficient Offensive Speech Classification with Definition Huy Nghiem et.al. 2402.03221v1 link
2024-02-05 Isotropy, Clusters, and Classifiers Timothee Mickus et.al. 2402.03191v1 null
2024-02-06 Cool-chic video: Learned video coding with 800 parameters Thomas Leguay et.al. 2402.03179v2 null
2024-02-05 Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings Gonçalo Gomes et.al. 2402.03172v1 link
2024-02-02 From gas to stars: MUSEings on the internal evolution of IC 1613 S. Taibi et.al. 2402.01631v1 null
2024-02-02 Truncation technique for variational quantum eigensolver for Molecular Hamiltonians Qidong Xu et.al. 2402.01630v1 null
2024-02-02 L2G2G: a Scalable Local-to-Global Network Embedding with Graph Autoencoders Ruikang Ouyang et.al. 2402.01614v1 link
2024-02-02 Immersive Video Compression using Implicit Neural Representations Ho Man Kwan et.al. 2402.01596v1 link
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590v1 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566v1 null
2024-02-02 Deep Continuous Networks Nergis Tomen et.al. 2402.01557v1 link
2024-02-02 SLYKLatent, a Learning Framework for Facial Features Estimation Samuel Adebayo et.al. 2402.01555v1 null
2024-02-02 Advancing Brain Tumor Inpainting with Generative Models Ruizhi Zhu et.al. 2402.01509v1 null
2024-02-02 Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses Mahboubeh Asadi et.al. 2402.01485v1 null
2024-02-01 We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline Simar Kareer et.al. 2402.00868v1 link
2024-02-01 Deep Room Impulse Response Completion Jackie Lin et.al. 2402.00859v1 null
2024-02-01 Early Time Classification with Accumulated Accuracy Gap Control Liran Ringel et.al. 2402.00857v1 link
2024-02-01 BootsTAP: Bootstrapped Training for Tracking-Any-Point Carl Doersch et.al. 2402.00847v1 link
2024-02-01 Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering Pinxin Liu et.al. 2402.00827v1 null
2024-02-01 Examining the Influence of Digital Phantom Models in Virtual Imaging Trials for Tomographic Breast Imaging Amar Kavuri et.al. 2402.00812v1 null
2024-02-01 ReAGent: Towards A Model-agnostic Feature Attribution Method for Generative Language Models Zhixue Zhao et.al. 2402.00794v1 link
2024-02-01 Distinguishing the Indistinguishable: Human Expertise in Algorithmic Prediction Rohan Alur et.al. 2402.00793v1 link
2024-02-02 CroissantLLM: A Truly Bilingual French-English Language Model Manuel Faysse et.al. 2402.00786v2 link
2024-02-01 Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics Eyup B. Unlu et.al. 2402.00776v1 null
2024-01-31 Classification-Oriented Semantic Wireless Communications Emrecan Kutay et.al. 2401.18069v1 null
2024-01-31 Rank Supervised Contrastive Learning for Time Series Classification Qianying Ren et.al. 2401.18057v1 null
2024-01-31 Variable selection for Naïve Bayes classification Rafael Blanquero et.al. 2401.18039v1 null
2024-01-31 Optimizing contrastive learning for cortical folding pattern detection Aymeric Gaudin et.al. 2401.18035v1 null
2024-01-31 A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 Darren Ramsook et.al. 2401.18021v1 null
2024-01-31 EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation Jonathan W. Kim et.al. 2401.18006v1 null
2024-01-31 Unsupervised Learning of Topological Non-Abelian Braiding in Non-Hermitian Bands Yang Long et.al. 2401.17968v1 null
2024-01-31 Error-Tolerant E-Discovery Protocols Jinshuo Dong et.al. 2401.17952v1 null
2024-01-31 HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction Harvie Zhang et.al. 2401.17948v1 null
2024-01-31 Probabilistic Photonic Computing with Chaotic Light Frank Brückerhoff-Plückelmann et.al. 2401.17915v1 null
2024-01-30 The SRG/eROSITA all-sky survey: Hard X-ray selected Active Galactic Nuclei Sophia G. H. Waddell et.al. 2401.17306v1 null
2024-01-30 Compact white-dwarf binaries in the combined SRG/eROSITA/SDSS eFEDS survey A. Schwope et.al. 2401.17304v1 null
2024-01-30 Searching for X-ray counterparts of unassociated Fermi-LAT sources and rotation-powered pulsars with SRG/eROSITA Martin G. F. Mayer et.al. 2401.17295v1 null
2024-01-30 X-ray AGNs with SRG/eROSITA: Multi-wavelength observations reveal merger triggering and post-coalescence circumnuclear blowout Robert W. Bickley et.al. 2401.17277v1 null
2024-01-30 ReacLLaMA: Merging chemical and textual information in chemical reactivity AI models Aline Hartgers et.al. 2401.17267v1 null
2024-01-30 SLIC: A Learned Image Codec Using Structure and Color Srivatsa Prativadibhayankaram et.al. 2401.17246v1 link
2024-01-31 Faster coloring and embedding in dense hypergraphs via stability Jianfeng Hou et.al. 2401.17219v2 null
2024-01-31 GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear Robert Konrad et.al. 2401.17217v2 null
2024-01-30 Single Word Change is All You Need: Designing Attacks and Defenses for Text Classifiers Lei Xu et.al. 2401.17196v1 null
2024-01-30 GraphViz2Vec: A Structure-aware Feature Generation Model to Improve Classification in GNNs Shraban Kumar Chatterjee et.al. 2401.17178v1 null
2024-01-29 Computer Vision for Primate Behavior Analysis in the Wild Richard Vogg et.al. 2401.16424v1 null
2024-01-29 Synchformer: Efficient Synchronization from Sparse Cues Vladimir Iashin et.al. 2401.16423v1 null
2024-01-29 Strategic Usage in a Multi-Learner Setting Eliot Shekhtman et.al. 2401.16422v1 null
2024-01-29 ReTaSA: A Nonparametric Functional Estimation Approach for Addressing Continuous Target Shift Hwanwoo Kim et.al. 2401.16410v1 null
2024-01-29 Is K-fold cross validation the best model selection method for Machine Learning? Juan M Gorriz et.al. 2401.16407v1 null
2024-01-29 Zero-shot Imitation Policy via Search in Demonstration Dataset Federco Malato et.al. 2401.16398v1 null
2024-01-29 Ovarian Cancer Diagnostics using Wavelet Packet Scaling Descriptors Raymond J. Hinton Jr. et.al. 2401.16396v1 null
2024-01-29 Evaluation of pseudo-healthy image reconstruction for anomaly detection with deep generative models: Application to brain FDG PET Ravi Hassanaly et.al. 2401.16363v1 link
2024-01-29 Curriculum-Based Reinforcement Learning for Quadrupedal Jumping: A Reference-free Design Vassil Atanassov et.al. 2401.16337v1 null
2024-01-29 Making the unmodulated Pyramid wavefront sensor smart. Closed-loop demonstration of neural network wavefront reconstruction with MagAO-X Rico Landman et.al. 2401.16325v1 null
2024-01-26 From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities Chaochao Lu et.al. 2401.15071v1 null
2024-01-26 Deep learning-based approach for tomato classification in complex scenes Mikael A. Mousse et.al. 2401.15055v1 null
2024-01-26 Non-Unitary $3 \times 3$ Mixing in Majorana Neutrinos and Vector-like Quark Models Pedro M. F. Pereira et.al. 2401.15049v1 null
2024-01-26 Machine learning-based analysis of glioma tissue sections: a review Jan-Philipp Redlich et.al. 2401.15022v1 null
2024-01-26 Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers Kerlos Atia Abdalmalak et.al. 2401.15018v1 null
2024-01-26 Graph-based Active Learning for Entity Cluster Repair Victor Christen et.al. 2401.14992v1 null
2024-01-26 Stokes graphs of the Rabi problem with real parameters René Langøen et.al. 2401.14991v1 null
2024-01-26 Minimum-dissipation principle for synchronised stochastic oscillators far from equilibrium Jan Meibohm et.al. 2401.14982v1 null
2024-01-26 Microwave lymphedema assessment using deep learning with contour assisted backprojection Yuyi Chang et.al. 2401.14970v1 null
2024-01-26 Hold Tight: Identifying Behavioral Patterns During Prolonged Work in VR through Video Analysis Verena Biener et.al. 2401.14920v1 null
2024-01-25 Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities Yiyuan Zhang et.al. 2401.14405v1 link
2024-01-25 Adaptive Mobile Manipulation for Articulated Objects In the Open World Haoyu Xiong et.al. 2401.14403v1 null
2024-01-25 Range-Agnostic Multi-View Depth Estimation With Keyframe Selection Andrea Conti et.al. 2401.14401v1 link
2024-01-25 Rethinking Patch Dependence for Masked Autoencoders Letian Fu et.al. 2401.14391v1 null
2024-01-25 Smooth Ranking SVM via Cutting-Plane Method Erhan Can Ozcan et.al. 2401.14388v1 link
2024-01-25 Inconsistency Masks: Removing the Uncertainty from Input-Pseudo-Label Pairs Michael R. H. Vorndran et.al. 2401.14387v1 link
2024-01-25 A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bengali Texts Kazi Toufique Elahi et.al. 2401.14360v1 link
2024-01-25 Computing Derivations on Nilpotent Quadratic Lie Algebras Pilar Benito et.al. 2401.14348v1 null
2024-01-25 Class-attribute Priors: Adapting Optimization to Heterogeneity and Fairness Objective Xuechen Zhang et.al. 2401.14343v1 null
2024-01-25 Progressive Multi-task Anti-Noise Learning and Distilling Frameworks for Fine-grained Vehicle Recognition Dichao Liu et.al. 2401.14336v1 link
2024-01-24 Tyche: Stochastic In-Context Learning for Medical Image Segmentation Marianne Rakic et.al. 2401.13650v1 null
2024-01-24 Quantifying the Impact of Frame Preemption on Combined TSN Shapers Rubi Debnath et.al. 2401.13631v1 null
2024-01-24 Can overfitted deep neural networks in adversarial training generalize? -- An approximation viewpoint Zhongjie Shi et.al. 2401.13624v1 null
2024-01-24 FLLIC: Functionally Lossless Image Compression Xi Zhang et.al. 2401.13616v1 null
2024-01-24 Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Naresh Kumar Lahajal et.al. 2401.13613v1 null
2024-01-24 Prompt Weight Experiments for LLM Instruction Fine-Tuning Mathew Huerta-Enochian et.al. 2401.13586v1 null
2024-01-24 WPDA: Frequency-based Backdoor Attack with Wavelet Packet Decomposition Zhengyao Song et.al. 2401.13578v1 null
2024-01-24 CNN architecture extraction on edge GPU Peter Horvath et.al. 2401.13575v1 null
2024-01-24 Benchmarking the Fairness of Image Upsampling Methods Mike Laszkiewicz et.al. 2401.13555v1 null
2024-01-24 PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition Otto Brookes et.al. 2401.13554v1 null
2024-01-23 SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI Hanxue Gu et.al. 2401.12974v1 null
2024-01-23 On the Efficacy of Text-Based Input Modalities for Action Anticipation Apoorva Beedu et.al. 2401.12972v1 null
2024-01-23 The role of environment and AGN feedback in quenching local galaxies: Comparing cosmological hydrodynamical simulations to the SDSS Paul H. Goubert et.al. 2401.12953v1 null
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945v1 null
2024-01-23 Long-range three-dimensional tracking of nanoparticles using interferometric scattering (iSCAT) microscopy Kiarash Kasaian et.al. 2401.12939v1 null
2024-01-23 Neural deformation fields for template-based reconstruction of cortical surfaces from MRI Fabian Bongratz et.al. 2401.12938v1 null
2024-01-23 Segmentation of tibiofemoral joint tissues from knee MRI using MtRA-Unet and incorporating shape information: Data from the Osteoarthritis Initiative Akshay Daydar et.al. 2401.12932v1 null
2024-01-23 pyAKI - An Open Source Solution to Automated KDIGO classification Christian Porschen et.al. 2401.12930v1 null
2024-01-23 Performance Analysis of Support Vector Machine (SVM) on Challenging Datasets for Forest Fire Detection Ankan Kar et.al. 2401.12924v1 null
2024-01-23 Advancing Glitch Classification in Gravity Spy: Multi-view Fusion with Attention-based Machine Learning for Advanced LIGO's Fourth Observing Run Yunan Wu et.al. 2401.12913v1 null
2024-01-22 Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition Haz Sameen Shahgir et.al. 2401.12210v1 null
2024-01-22 Unsupervised Machine Learning for the Classification of Astrophysical X-ray Sources Víctor Samuel Pérez-Díaz et.al. 2401.12203v1 link
2024-01-22 OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Peiqi Liu et.al. 2401.12202v1 null
2024-01-22 In-Context Learning for Extreme Multi-Label Classification Karel D'Oosterlinck et.al. 2401.12178v1 null
2024-01-22 Broiler-Net: A Deep Convolutional Framework for Broiler Behavior Analysis in Poultry Houses Tahereh Zarrat Ehsan et.al. 2401.12176v1 link
2024-01-22 VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear Responses in VR Stand-up Interactive Games He Zhang et.al. 2401.12133v1 link
2024-01-22 Evaluation of QCNN-LSTM for Disability Forecasting in Multiple Sclerosis Using Sequential Multisequence MRI John D. Mayfield et.al. 2401.12132v1 null
2024-01-22 Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy Will LeVine et.al. 2401.12129v1 link
2024-01-22 Measures of the Capital Network of the U.S. Economy Ben Klemens et.al. 2401.12118v1 null
2024-01-22 A quantitative version of the Steinhaus theorem Alex Iosevich et.al. 2401.12112v1 null
2024-01-19 Classifying affine structures with focus-focus singularities Xiudi Tang et.al. 2401.10881v1 null
2024-01-19 Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning André O. Françani et.al. 2401.10857v1 null
2024-01-19 Emotion Classification In Software Engineering Texts: A Comparative Analysis of Pre-trained Transformers Language Models Mia Mohammad Imran et.al. 2401.10845v1 null
2024-01-19 Understanding Video Transformers via Universal Concept Discovery Matthew Kowal et.al. 2401.10831v1 null
2024-01-19 Long-Term Monitoring of the Oe Star VES 735: Ope! Not So Quiet After All Brandon Marshall et.al. 2401.10829v1 null
2024-01-19 ActAnywhere: Subject-Aware Video Background Generation Boxiao Pan et.al. 2401.10822v1 null
2024-01-19 RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision Fernando Pérez-García et.al. 2401.10815v1 null
2024-01-19 Learning to Visually Connect Actions and their Effects Eric Peh et.al. 2401.10805v1 null
2024-01-19 Endovascular Detection of Catheter-Thrombus Contact by Vacuum Excitation Jared Lawson et.al. 2401.10804v1 null
2024-01-19 TDC-less Direct Time-of-Flight Imaging Using Spiking Neural Networks Jack MacLean et.al. 2401.10793v1 null
2024-01-18 Simultaneous Tactile Estimation and Control for Extrinsic Dexterity Antonia Bronars et.al. 2401.10230v1 null
2024-01-18 OMG-Seg: Is One Model Good Enough For All Segmentation? Xiangtai Li et.al. 2401.10229v1 link
2024-01-18 RAP-SAM: Towards Real-Time All-Purpose Segment Anything Shilin Xu et.al. 2401.10228v1 link
2024-01-18 Towards Language-Driven Video Inpainting via Multimodal Large Language Models Jianzong Wu et.al. 2401.10226v1 null
2024-01-18 Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions Namitha Padmanabhan et.al. 2401.10217v1 null
2024-01-18 Transfer Learning in Human Activity Recognition: A Survey Sourish Gunesh Dhekane et.al. 2401.10185v1 null
2024-01-18 SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild Andreas Engelhardt et.al. 2401.10171v1 null
2024-01-19 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150v2 null
2024-01-18 Few-shot learning for COVID-19 Chest X-Ray Classification with Imbalanced Data: An Inter vs. Intra Domain Study Alejandro Galán-Cuenca et.al. 2401.10129v1 null
2024-01-18 Sub2Full: split spectrum to boost OCT despeckling without clean data Lingyun Wang et.al. 2401.10128v1 link
2024-01-17 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Lianghui Zhu et.al. 2401.09417v1 link
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414v1 link
2024-01-17 Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text Mazal Bethany et.al. 2401.09407v1 null
2024-01-17 Élivágar: Efficient Quantum Circuit Search for Classification Sashwat Anagolum et.al. 2401.09393v1 null
2024-01-17 Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid Luchuan Song et.al. 2401.09386v1 link
2024-01-17 New relations of pod partition and its connection with other partition functions Hemjyoti Nath et.al. 2401.09374v1 null
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336v1 link
2024-01-17 Machines Do See Color: A Guideline to Classify Different Forms of Racist Discourse in Large Corpora Diana Davila Gordillo et.al. 2401.09333v1 null
2024-01-17 Spectral Distribution Complexity of the Surface Fibrillatory Waves Predicts Post-Catheter Ablation Relapse in Persistent Atrial Fibrillation Pilar Escribano et.al. 2401.09297v1 null
2024-01-17 T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Yoonjin Chung et.al. 2401.09294v1 null
2024-01-16 From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers Jiu Feng et.al. 2401.08415v1 null
2024-01-16 Faster ISNet for Background Bias Mitigation on Deep Neural Networks Pedro R. A. S. Bassi et.al. 2401.08409v1 null
2024-01-16 Training and Comparison of nnU-Net and DeepMedic Methods for Autosegmentation of Pediatric Brain Tumors Arastoo Vossough et.al. 2401.08404v1 null
2024-01-16 High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering Xin Ming et.al. 2401.08398v1 null
2024-01-16 DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models Zongxin Yang et.al. 2401.08392v1 link
2024-01-16 We don't need no labels: Estimating post-deployment model performance under covariate shift without ground truth Jakub Białek et.al. 2401.08348v1 null
2024-01-16 Learn What You Need in Personalized Federated Learning Kexin Lv et.al. 2401.08327v1 link
2024-01-16 Application of LLM Agents in Recruitment: A Novel Framework for Resume Screening Chengguang Gan et.al. 2401.08315v1 null
2024-01-16 Central extensions of restricted Lie superalgebras and classification of $p$-nilpotent Lie superalgebras in dimension $4$ Sofiane Bouarroudj et.al. 2401.08313v1 null
2024-01-16 Evaluating online elasticity estimation of soft objects using standard robot grippers Shubhan P. Patni et.al. 2401.08298v1 null
2024-01-16 Multitask Learning in Minimally Invasive Surgical Vision: A Review Oluwatosin Alabi et.al. 2401.08256v1 null
2024-01-16 Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization Chongzhi Zhang et.al. 2401.08232v1 null
2024-01-16 Towards Causal Relationship in Indefinite Data: Baseline Model and New Datasets Hang Chen et.al. 2401.08221v1 link
2024-01-16 Ship Detection in SAR Images with Human-in-the-Loop Hecheng Jia et.al. 2401.08213v1 null
2024-01-16 ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification Zhongbin Fang et.al. 2401.08210v1 link
2024-01-12 Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements Anton Voronov et.al. 2401.06766v1 null
2024-01-12 Classification of singularities of cluster algebras of finite type II: coefficients Angélica Benito et.al. 2401.06758v1 null
2024-01-12 Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction Muhammad Naveed Riaz et.al. 2401.06757v1 null
2024-01-12 Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection Muhammad Tayyab Zamir et.al. 2401.06752v1 null
2024-01-12 Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part II: Spatial and Tonal Data Optimization Niklas Kämper et.al. 2401.06747v1 null
2024-01-12 Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting Niklas Kämper et.al. 2401.06744v1 null
2024-01-12 Complexity Classification of Product State Problems for Local Hamiltonians John Kallaugher et.al. 2401.06725v1 null
2024-01-12 Obstacle-Aware Positioning of a Mobile Robotic Platform for 6G Networks Alexandre Costa et.al. 2401.06717v1 null
2024-01-12 Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text Muskan Garg et.al. 2401.06709v1 null
2024-01-12 On the existence of charged electrostatic black holes in arbitrary topology Martin Reiris et.al. 2401.06702v1 null
2024-01-11 Distilling Vision-Language Models on Millions of Videos Yue Zhao et.al. 2401.06129v1 null
2024-01-11 Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors Jack Saunders et.al. 2401.06126v1 null
2024-01-11 Gaussian Shadow Casting for Neural Characters Luis Bolanos et.al. 2401.06116v1 null
2024-01-11 A Closer Look at AUROC and AUPRC under Class Imbalance Matthew B. A. McDermott et.al. 2401.06091v1 link
2024-01-12 LEGO:Language Enhanced Multi-modal Grounding Model Zhaowei Li et.al. 2401.06071v2 link
2024-01-11 On the Power of Graph Neural Networks and Feature Augmentation Strategies to Classify Social Networks Walid Guettala et.al. 2401.06048v1 null
2024-01-11 RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks Partha Ghosh et.al. 2401.06035v1 null
2024-01-11 Attention to detail: inter-resolution knowledge distillation Rocío del Amor et.al. 2401.06010v1 link
2024-01-11 Sea ice detection using concurrent multispectral and synthetic aperture radar imagery Martin S J Rogers et.al. 2401.06009v1 null
2024-01-11 Boosting Mixed-Initiative Co-Creativity in Game Design: A Tutorial Solange Margarido et.al. 2401.05999v1 null
2024-01-10 Towards Online Sign Language Recognition and Translation Ronglai Zuo et.al. 2401.05336v1 link
2024-01-10 ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video Kevin Cai et.al. 2401.05314v1 link
2024-01-10 Strategic Client Selection to Address Non-IIDness in HAPS-enabled FL Networks Amin Farajzadeh et.al. 2401.05308v1 null
2024-01-10 Frame-like Fourier expansions for finite Borel measures on $\mathbb{R}$ Chad Berner et.al. 2401.05243v1 null
2024-01-10 Learning effective good variables from physical data Giulio Barletta et.al. 2401.05226v1 link
2024-01-10 TOVAC: Tele-operated Vehicle Admission Control and Routing Jorge Martín-Pérez et.al. 2401.05225v1 null
2024-01-10 Do Vision and Language Encoders Represent the World Similarly? Mayug Maniparambil et.al. 2401.05224v1 null
2024-01-10 Exploring Vulnerabilities of No-Reference Image Quality Assessment Models: A Query-Based Black-Box Method Chenxi Yang et.al. 2401.05217v1 null
2024-01-10 Pre-trained Large Language Models for Financial Sentiment Analysis Wei Luo et.al. 2401.05215v1 link
2024-01-10 A Novel Prompt-tuning Method: Incorporating Scenario-specific Concepts into a Verbalizer Yong Ma et.al. 2401.05204v1 null
2024-01-09 A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars Ronglai Zuo et.al. 2401.04730v1 link
2024-01-09 U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation Jun Ma et.al. 2401.04722v1 null
2024-01-09 Helicoidal surfaces of prescribed mean curvature in $\mathbb{R}^3$ Aires Eduardo Menani Barbieri et.al. 2401.04721v1 null
2024-01-09 Low-resource finetuning of foundation models beats state-of-the-art in histopathology Benedikt Roth et.al. 2401.04720v1 null
2024-01-09 Jump Cut Smoothing for Talking Heads Xiaojuan Wang et.al. 2401.04718v1 null
2024-01-09 NIPn CHIPS Blaise Boissonneau et.al. 2401.04697v1 null
2024-01-09 CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural Networks Sunny Howard et.al. 2401.04680v1 null
2024-01-09 Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs Dataset Galib Muhammad Shahriar Himel et.al. 2401.04666v1 null
2024-01-09 DepressionEmo: A novel dataset for multilabel classification of depression emotions Abu Bakar Siddiqur Rahman et.al. 2401.04655v1 link
2024-01-09 Hold 'em and Fold 'em: Towards Human-scale, Feedback-Controlled Soft Origami Robots Immanuel Ampomah Mensah et.al. 2401.04650v1 null
2024-01-08 Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning Chen Zhao et.al. 2401.04105v1 null
2024-01-08 RudolfV: A Foundation Model by Pathologists for Pathologists Jonas Dippel et.al. 2401.04079v1 null
2024-01-08 Variance Reduction in Ratio Metrics for Efficient Online Experiments Shubham Baweja et.al. 2401.04062v1 null
2024-01-08 Bjøntegaard Delta (BD): A Tutorial Overview of the Metric, Evolution, Challenges, and Recommendations Nabajeet Barman et.al. 2401.04039v1 null
2024-01-08 Blocks whose defect groups are Suzuki $2$-groups Charles W. Eaton et.al. 2401.04028v1 null
2024-01-08 IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification Abdullah Alsuhaibani et.al. 2401.04025v1 null
2024-01-08 Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification Wentao Zhu et.al. 2401.04023v1 null
2024-01-08 Resident space object detection method based on the connection between Fourier spectrum of the video data difference frame and the linear velocity projection V. S. Baranova et.al. 2401.04021v1 null
2024-01-09 Recognizing Blazars Using Radio Morphology from the VLA Sky Survey Zhang-Liang Xie et.al. 2401.04009v2 null
2024-01-08 Calabi-Yau Varieties via Cyclic Covers, and Complex Hyperbolic Structures for their Moduli Spaces Chenglong Yu et.al. 2401.04006v1 null
2024-01-05 Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively Haobo Yuan et.al. 2401.02955v1 link
2024-01-05 The Dark Energy Survey Supernova Program: Cosmological Analysis and Systematic Uncertainties M. Vincenzi et.al. 2401.02945v1 null
2024-01-05 Digital-analog quantum learning on Rydberg atom arrays Jonathan Z. Lu et.al. 2401.02940v1 null
2024-01-05 Mixing Magnetic and Electric Ehlers-Harrison transformations: The Electromagnetic Swirling Spacetime and Novel Type I Backgrounds José Barrientos et.al. 2401.02924v1 null
2024-01-05 Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks Kevin Everson et.al. 2401.02921v1 null
2024-01-05 Analytically-Driven Resource Management for Cloud-Native Microservices Yanqi Zhang et.al. 2401.02920v1 null
2024-01-05 Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task Gabriel Lino Garcia et.al. 2401.02909v1 null
2024-01-05 Robust Bichromatic Classification using Two Lines Erwin Glazenburg et.al. 2401.02897v1 null
2024-01-05 Particle-Wise Higher-Order SPH Field Approximation for DVR Jonathan Fischer et.al. 2401.02896v1 null
2024-01-05 Nonlinear functional regression by functional deep neural network with kernel embedding Zhongjie Shi et.al. 2401.02890v1 null
2024-01-04 asimulation: Domain formation and impact on observables in resolved cosmological simulations of the (a)symmetron Øyvind Christiansen et.al. 2401.02410v1 link
2024-01-04 Gravitational waves from dark domain walls Øyvind Christiansen et.al. 2401.02409v1 link
2024-01-05 Correctness Comparison of ChatGPT-4, Bard, Claude-2, and Copilot for Spatial Tasks Hartwig H. Hochmair et.al. 2401.02404v2 null
2024-01-04 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation Zihao Xiao et.al. 2401.02402v1 null
2024-01-04 Analyzing Misinformation Claims During the 2022 Brazilian General Election on WhatsApp, Twitter, and Kwai Scott A. Hale et.al. 2401.02395v1 null
2024-01-04 Image denoising and model-independent parameterization for improving IVIM MRI Caleb Sample et.al. 2401.02394v1 null
2024-01-04 Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications Darshan Venkatrayappa et.al. 2401.02383v1 null
2024-01-04 A novel method to enhance pneumonia detection via a model-level ensembling of CNN and vision transformer Sandeep Angara et.al. 2401.02358v1 null
2024-01-04 ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation Xinyang Pu et.al. 2401.02326v1 link
2024-01-04 Reflection physics in X-ray-emitting Symbiotic Stars Jesús A. Toalá et.al. 2401.02318v1 null
2024-01-03 Profinite equivariant spectra and their tensor-triangular geometry Scott Balchin et.al. 2401.01878v1 null
2024-01-03 A spatial mixture model for spaceborne lidar observations over mixed forest and non-forest land types Paul B. May et.al. 2401.01848v1 null
2024-01-03 Teaching with a companion: the case of gravity Iuliia Zhurakovskaia et.al. 2401.01832v1 null
2024-01-03 Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling Himmet Toprak Kesgin et.al. 2401.01830v1 null
2024-01-03 Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions David Junhao Zhang et.al. 2401.01827v1 link
2024-01-03 Detours for Navigating Instructional Videos Kumar Ashutosh et.al. 2401.01823v1 null
2024-01-03 SENS3: Multisensory Database of Finger-Surface Interactions and Corresponding Sensations Jagan K. Balasubramanian et.al. 2401.01818v1 null
2024-01-03 Signal Processing in the Retina: Interpretable Graph Classifier to Predict Ganglion Cell Responses Yasaman Parhizkar et.al. 2401.01813v1 null
2024-01-03 Efficient Computation of Confidence Sets Using Classification on Equidistributed Grids Lujie Zhou et.al. 2401.01804v1 null
2024-01-03 An experimental sorting method for improving metagenomic data encoding Diogo Pratas et.al. 2401.01786v1 null
2024-01-02 Street Gaussians for Modeling Dynamic Urban Scenes Yunzhi Yan et.al. 2401.01339v1 null
2024-01-02 Classifying Words with 3-sort Automata Tomasz Jastrząb et.al. 2401.01314v1 null
2024-01-03 A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models S. M Towhidul Islam Tonmoy et.al. 2401.01313v2 null
2024-01-02 Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images Subin Sahayam et.al. 2401.01303v1 null
2024-01-02 $f$-Divergence Based Classification: Beyond the Use of Cross-Entropy Nicola Novello et.al. 2401.01268v1 link
2024-01-02 VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM Fuchen Long et.al. 2401.01256v1 null
2024-01-02 An operational approach to classifying measurement incompatibility Arun Kumar Das et.al. 2401.01236v1 null
2024-01-03 Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond Dimitrios Kollias et.al. 2401.01219v2 null
2024-01-02 FGENet: Fine-Grained Extraction Network for Congested Crowd Counting Hao-Yuan Ma et.al. 2401.01208v1 null
2024-01-02 Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scans Lorenzo Venturini et.al. 2401.01201v1 null
2023-12-29 Computational Tools for Trees in Gauge Theory and Gravity Jacob L. Bourjaily et.al. 2312.17745v1 null
2023-12-29 Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization Ioanna Ntinou et.al. 2312.17686v1 null
2023-12-29 Malware Detection in IOT Systems Using Machine Learning Techniques Ali Mehrban et.al. 2312.17683v1 null
2023-12-29 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis Feng Liang et.al. 2312.17681v1 null
2023-12-29 Grasping, Part Identification, and Pose Refinement in One Shot with a Tactile Gripper Joyce Xin-Yan Lim et.al. 2312.17650v1 null
2023-12-29 MoD2T:Model-Data-Driven Motion-Static Object Tracking Method Yang Feng et.al. 2312.17641v1 null
2023-12-29 A New Explanation of the Mechanism of Hadley Circulation Wei Huang et.al. 2312.17637v1 null
2023-12-29 Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training Dongfang Li et.al. 2312.17591v1 null
2023-12-29 A Tool for the Procedural Generation of Shaders using Interactive Evolutionary Algorithms Elio Sasso et.al. 2312.17587v1 link
2023-12-29 Distribution-based Low-rank Embedding Bardia Yousefi et.al. 2312.17579v1 null
2023-12-28 A Simple LLM Framework for Long-Range Video Question-Answering Ce Zhang et.al. 2312.17235v1 null
2023-12-28 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency Yuyang Yin et.al. 2312.17225v1 null
2023-12-28 EFHQ: Multi-purpose ExtremePose-Face-HQ dataset Trung Tuan Dao et.al. 2312.17205v1 null
2023-12-28 One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts Ziheng Zhao et.al. 2312.17183v1 null
2023-12-28 Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action Jiasen Lu et.al. 2312.17172v1 null
2023-12-28 Classification of multiplication modules over multiplication rings with finitely many minimal primes Volodymyr Bavula et.al. 2312.17170v1 null
2023-12-28 Securing NextG Systems against Poisoning Attacks on Federated Learning: A Game-Theoretic Solution Yalin E. Sagduyu et.al. 2312.17164v1 null
2023-12-28 Replica Tree-based Federated Learning using Limited Data Ramona Ghilea et.al. 2312.17159v1 null
2023-12-29 ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe Yifan Bai et.al. 2312.17133v2 null
2023-12-28 Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos Houlun Chen et.al. 2312.17117v1 null
2023-12-26 Microwave signal processing using an analog quantum reservoir computer Alen Senanian et.al. 2312.16166v1 null
2023-12-26 Large-scale Long-tailed Disease Diagnosis on Radiology Images Qiaoyu Zheng et.al. 2312.16151v1 null
2023-12-27 The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias Timo Spinde et.al. 2312.16148v2 link
2023-12-26 The non-Abelian Aharonov-Bohm effect P. A. Horvathy et.al. 2312.16133v1 null
2023-12-26 LangSplat: 3D Language Gaussian Splatting Minghan Qin et.al. 2312.16084v1 null
2023-12-26 AdaNAS: Adaptively Post-processing with Self-supervised Neural Architecture Search for Ensemble Rainfall Forecasts Yingpeng Wen et.al. 2312.16046v1 null
2023-12-26 An extended asymmetric sigmoid with Perceptron (SIGTRON) for imbalanced linear classification Hyenkyun Woo et.al. 2312.16043v1 null
2023-12-26 Multi-scale Progressive Feature Embedding for Accurate NIR-to-RGB Spectral Domain Translation Xingxing Yang et.al. 2312.16040v1 null
2023-12-26 Plug-and-Play Regularization on Magnitude with Deep Priors for 3D Near-Field MIMO Imaging Okyanus Oral et.al. 2312.16024v1 null
2023-12-26 Classification of positive solutions of Hardy-Sobolev equation without the finite volume constraints Lu Chen et.al. 2312.16017v1 null
2023-12-25 Training Convolutional Neural Networks with the Forward-Forward algorithm Riccardo Scodellaro et.al. 2312.14924v2 null
2023-12-22 DRStageNet: Deep Learning for Diabetic Retinopathy Staging from Fundus Images Yevgeniy Men et.al. 2312.14891v1 null
2023-12-22 On rate-optimal classification from non-private and from private data Balázs Csanád Csáji et.al. 2312.14889v1 null
2023-12-22 Classification of cubic tricirculant nut graphs Ivan Damnjanović et.al. 2312.14884v1 null
2023-12-22 Neural-network-based regularization methods for inverse problems in imaging Andreas Habring et.al. 2312.14849v1 null
2023-12-22 Classification of 3-GNDB Graphs Amir Hosseini et.al. 2312.14835v1 null
2023-12-22 Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models Tanish Baranwal et.al. 2312.14830v1 null
2023-12-22 Classification of generalised higher-order Einstein-Maxwell Lagrangians Aimeric Colléaux et.al. 2312.14814v1 null
2023-12-22 On support vector machines under a multiple-cost scenario Sandra Benítez-Peña et.al. 2312.14795v1 null
2023-12-22 The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs Junli Fang et.al. 2312.14792v1 null
2023-12-21 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera Christen Millerdurai et.al. 2312.14157v1 null
2023-12-21 Virtual Pets: Animatable Animal Generation in 3D Scenes Yen-Chi Cheng et.al. 2312.14154v1 null
2023-12-21 TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification Qinying Liu et.al. 2312.14149v1 link
2023-12-21 HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs Artem Sevastopolsky et.al. 2312.14140v1 null
2023-12-21 Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach Qinying Liu et.al. 2312.14138v1 link
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134v1 null
2023-12-21 WellFactor: Patient Profiling using Integrative Embedding of Healthcare Data Dongjin Choi et.al. 2312.14129v1 null
2023-12-21 VideoPoet: A Large Language Model for Zero-Shot Video Generation Dan Kondratyuk et.al. 2312.14125v1 null
2023-12-21 LingoQA: Video Question Answering for Autonomous Driving Ana-Maria Marcu et.al. 2312.14115v1 link
2023-12-21 LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding Senqiao Yang et.al. 2312.14074v1 null
2023-12-20 Deep Learning on 3D Neural Fields Pierluigi Zama Ramirez et.al. 2312.13277v1 null
2023-12-20 The 1/4-BPS building blocks of brane interactions Ben Eckardt et.al. 2312.13269v1 null
2023-12-20 ClassLIE: Structure- and Illumination-Adaptive Classification for Low-Light Image Enhancement Zixiang Wei et.al. 2312.13265v1 null
2023-12-20 Putting the p back in Prym Jeff Achter et.al. 2312.13263v1 null
2023-12-20 The role of data embedding in equivariant quantum convolutional neural networks Sreetama Das et.al. 2312.13250v1 null
2023-12-20 Enhancing Neural Training via a Correlated Dynamics Model Jonathan Brokman et.al. 2312.13247v1 null
2023-12-20 SISMIK for brain MRI: Deep-learning-based motion estimation and model-based motion correction in k-space Oscar Dabrowski et.al. 2312.13220v1 null
2023-12-20 Boost recall in QSO selection from highly imbalanced photometric datasets Giorgio Calderone et.al. 2312.13194v1 null
2023-12-20 Ergodic measures for periodic type $\mathbb{Z}^m$-skew-products over Interval Exchange Transformations Yuriy Tumarkin et.al. 2312.13165v1 null
2023-12-20 Underwater Acoustic Signal Recognition Based on Salient Features Minghao Chen et.al. 2312.13143v1 null
2023-12-19 Tracking Any Object Amodally Cheng-Yen Hsieh et.al. 2312.12433v1 null
2023-12-19 The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark Aditya Murali et.al. 2312.12429v1 null
2023-12-19 Chasing Fairness in Graphs: A GNN Architecture Perspective Zhimeng Jiang et.al. 2312.12369v1 link
2023-12-19 Easy quantum groups Teo Banica et.al. 2312.12368v1 null
2023-12-19 SMC-NCA: Semantic-guided Multi-level Contrast for Semi-supervised Action Segmentation Feixiang Zhou et.al. 2312.12347v1 null
2023-12-19 On the Effectiveness of Retrieval, Alignment, and Replay in Manipulation Norman Di Palo et.al. 2312.12345v1 null
2023-12-19 Full-reference Video Quality Assessment for User Generated Content Transcoding Zihao Qi et.al. 2312.12317v1 null
2023-12-19 First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria Stefan Schoder et.al. 2312.12314v1 null
2023-12-19 Holography of New Conformal Higher Spin Gravities in 3d I. Lovrekovic et.al. 2312.12301v1 null
2023-12-19 Prompt-based Domain Discrimination for Multi-source Time Series Domain Adaptation Junxiang Wang et.al. 2312.12276v1 null
2023-12-18 Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models Reek Majumder et.al. 2312.10879v1 null
2023-12-18 Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation Hui Fu et.al. 2312.10877v1 null
2023-12-17 Global relaxation-based LP-Newton method for multiple hyperparameter selection in support vector classification with feature selection Qingna Li et.al. 2312.10848v1 null
2023-12-17 Online Boosting Adaptive Learning under Concept Drift for Multistream Classification En Yu et.al. 2312.10841v1 null
2023-12-17 Learning to Act without Actions Dominik Schmidt et.al. 2312.10812v1 null
2023-12-17 Land use/land cover classification of fused Sentinel-1 and Sentinel-2 imageries using ensembles of Random Forests Shivam Pande et.al. 2312.10798v1 null
2023-12-17 Learning to Learn in Interactive Constraint Acquisition Dimos Tsouros et.al. 2312.10795v1 null
2023-12-17 Identification of Knowledge Neurons in Protein Language Models Divya Nori et.al. 2312.10770v1 null
2023-12-17 Multi-Label Classification of COVID-Tweets Using Large Language Models Aniket Deroy et.al. 2312.10748v1 link
2023-12-17 Unmasking Deepfake Faces from Videos Using An Explainable Cost-Sensitive Deep Learning Approach Faysal Mahmud et.al. 2312.10740v1 link
2023-12-15 Understanding Probe Behaviors through Variational Bounds of Mutual Information Kwanghee Choi et.al. 2312.10019v1 link
2023-12-15 Wearable Coaxially-shielded Metamaterial for Magnetic Resonance Imaging Xia Zhu et.al. 2312.10018v1 null
2023-12-15 On the Invertibility of Euler Integral Transforms with Hyperplanes and Quadric Hypersurfaces Mattie Ji et.al. 2312.10002v1 null
2023-12-15 Towards Architecture-Insensitive Untrained Network Priors for Accelerated MRI Reconstruction Yilin Liu et.al. 2312.09988v1 null
2023-12-15 DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing Abdul Wasi et.al. 2312.09955v1 null
2023-12-15 Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction Yuanbo Hou et.al. 2312.09952v1 null
2023-12-15 LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer Yuxin Cao et.al. 2312.09935v1 link
2023-12-15 RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding Yuxin Zi et.al. 2312.09932v1 null
2023-12-15 Reliable Probabilistic Classification with Neural Networks Harris Papadopoulos et.al. 2312.09912v1 null
2023-12-15 TMP: Temporal Motion Propagation for Online Video Super-Resolution Zhengqiang Zhang et.al. 2312.09909v1 null
2023-12-14 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting Zhiyin Qian et.al. 2312.09228v1 null
2023-12-14 Efficient Online Learning of Contact Force Models for Connector Insertion Kevin Tracy et.al. 2312.09190v1 null
2023-12-14 General Object Foundation Model for Images and Videos at Scale Junfeng Wu et.al. 2312.09158v1 null
2023-12-14 Evaluating Augmented Reality Communication: How Can We Teach Procedural Skill in AR? Manuel Rebol et.al. 2312.09152v1 null
2023-12-14 Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting Anthony Chen et.al. 2312.09148v1 null
2023-12-14 Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy Junsu Kim et.al. 2312.09139v1 null
2023-12-14 Less is more -- the Dispatcher/ Executor principle for multi-task Reinforcement Learning Martin Riedmiller et.al. 2312.09120v1 null
2023-12-14 VideoLCM: Video Latent Consistency Model Xiang Wang et.al. 2312.09109v1 null
2023-12-14 FastInject: Injecting Unpaired Text Data into CTC-based ASR training Keqi Deng et.al. 2312.09100v1 null
2023-12-14 Agent Attention: On the Integration of Softmax and Linear Attention Dongchen Han et.al. 2312.08874v1 link
2023-12-13 VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering Xijun Wang et.al. 2312.08367v1 null
2023-12-13 Challenges and Opportunities in Implementing Negative Differential Resistance Mode Reconfigurable Field Effect Transistors Lephe S et.al. 2312.08351v1 null
2023-12-13 Ehancing CT Image synthesis from multi-modal MRI data based on a multi-task neural network framework Zhuoyao Xin et.al. 2312.08343v1 null
2023-12-13 Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach Yiqun Liu et.al. 2312.08330v1 null
2023-12-13 Affine monoids of corank one Yulia Zaitseva et.al. 2312.08316v1 null
2023-12-13 VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space Guénolé Fiche et.al. 2312.08291v1 null
2023-12-13 PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models Anis Bourou et.al. 2312.08290v1 link
2023-12-13 On the verification of Embeddings using Hybrid Markov Logic Anup Shakya et.al. 2312.08287v1 null
2023-12-14 High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models Songchi Zhou et.al. 2312.08274v2 null
2023-12-13 Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation Arul Selvam Periyasamy et.al. 2312.08268v1 null
2023-12-12 diff History for Long-Context Language Agents Ulyana Piterbarg et.al. 2312.07540v1 null
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537v1 link
2023-12-12 WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion Soyong Shin et.al. 2312.07531v1 null
2023-12-12 RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation Peng Lu et.al. 2312.07526v1 link
2023-12-12 PEEKABOO: Interactive Video Generation via Masked-Diffusion Yash Jain et.al. 2312.07509v1 null
2023-12-12 NAC-TCN: Temporal Convolutional Networks with Causal Dilated Neighborhood Attention for Emotion Understanding Alexander Mehta et.al. 2312.07507v1 link
2023-12-12 COLMAP-Free 3D Gaussian Splatting Yang Fu et.al. 2312.07504v1 null
2023-12-12 NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images Gia-Bao Le et.al. 2312.07489v1 null
2023-12-12 MinD-3D: Reconstruct High-quality 3D objects in Human Brain Jianxiong Gao et.al. 2312.07485v1 null
2023-12-12 Classification of retail products: From probabilistic ranking to neural networks Manar Mohamed Hafez et.al. 2312.07482v1 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662v1 null
2023-12-11 LightSim: Neural Lighting Simulation for Urban Scenes Ava Pun et.al. 2312.06654v1 null
2023-12-11 Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection Teodora Popordanoska et.al. 2312.06645v1 null
2023-12-11 Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Shangchen Zhou et.al. 2312.06640v1 null
2023-12-12 TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation Rongkun Zheng et.al. 2312.06630v2 link
2023-12-11 Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism Georgios Milis et.al. 2312.06613v1 link
2023-12-11 Early Action Recognition with Action Prototypes Guglielmo Camporese et.al. 2312.06598v1 null
2023-12-11 Flexible visual prompts for in-context learning in computer vision Thomas Foster et.al. 2312.06592v1 link
2023-12-11 QuickQuakeBuildings: Post-earthquake SAR-Optical Dataset for Quick Damaged-building Detection Yao Sun et.al. 2312.06587v1 null
2023-12-12 ESO/HARPS Radial Velocities Catalog Mauro Barbieri et.al. 2312.06586v2 null
2023-12-08 The Long Secondary Period (LSP) Variables: Overview and Some Analysis John R. Percy et.al. 2312.05255v1 null
2023-12-08 Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration Qi-Wei Wang et.al. 2312.05229v1 null
2023-12-08 Shape Matters: Detecting Vertebral Fractures Using Differentiable Point-Based Shape Decoding Hellena Hempe et.al. 2312.05220v1 link
2023-12-08 Enhancing Facial Classification and Recognition using 3D Facial Models and Deep Learning Houting Li et.al. 2312.05219v1 null
2023-12-08 IntrinsicAvatar: Physically Based Inverse Rendering of Dynamic Humans from Monocular Videos via Explicit Ray Tracing Shaofei Wang et.al. 2312.05210v1 null
2023-12-08 Embedding theory in ML toward real-time tracking of structural dynamics through hyperspectral datasets Jonathan D Hollenbach et.al. 2312.05201v1 null
2023-12-08 Video-Based Rendering Techniques: A Survey Rafael Kuffner dos Anjos et.al. 2312.05179v1 null
2023-12-08 Enhancing Single-Frame Supervision for Better Temporal Action Localization Changjian Chen et.al. 2312.05178v1 null
2023-12-08 MRI Scan Synthesis Methods based on Clustering and Pix2Pix Giulia Baldini et.al. 2312.05176v1 null
2023-12-08 TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis Heming Zhu et.al. 2312.05161v1 null
2023-12-07 GenDeF: Learning Generative Deformation Field for Video Generation Wen Wang et.al. 2312.04561v1 null
2023-12-07 MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar Yufan Chen et.al. 2312.04558v1 null
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557v1 null
2023-12-07 SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing Tomoki Ichikawa et.al. 2312.04553v1 null
2023-12-07 PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play Lili Chen et.al. 2312.04549v1 null
2023-12-07 Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception? Aritra Dutta et.al. 2312.04548v1 null
2023-12-07 Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models Ivan Kapelyukh et.al. 2312.04533v1 null
2023-12-07 Camera Height Doesn't Change: Unsupervised Monocular Scale-Aware Road-Scene Depth Estimation Genki Kinoshita et.al. 2312.04530v1 null
2023-12-07 RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models Ozgur Kara et.al. 2312.04524v1 link
2023-12-07 Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation Zhiwu Qing et.al. 2312.04483v1 null
2023-12-06 OneLLM: One Framework to Align All Modalities with Language Jiaming Han et.al. 2312.03700v1 link
2023-12-07 Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Umberto Cappellazzo et.al. 2312.03694v2 null
2023-12-06 Direct Exoplanet Detection Using Deep Convolutional Image Reconstruction (ConStruct): A New Algorithm for Post-Processing High-Contrast Images Trevor N. Wolf et.al. 2312.03671v1 null
2023-12-06 Annihilating branching Brownian motion Daniel Ahlberg et.al. 2312.03669v1 null
2023-12-06 Towards small and accurate convolutional neural networks for acoustic biodiversity monitoring Serge Zaugg et.al. 2312.03666v1 null
2023-12-06 Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving Ming Nie et.al. 2312.03661v1 link
2023-12-06 Editable Stain Transformation Of Histological Images Using Unpaired GANs Tibor Sloboda et.al. 2312.03647v1 link
2023-12-06 MotionCtrl: A Unified and Flexible Motion Controller for Video Generation Zhouxia Wang et.al. 2312.03641v1 null
2023-12-06 Training Neural Networks on RAW and HDR Images for Restoration Tasks Lei Luo et.al. 2312.03640v1 link
2023-12-07 Evaluation of Active Feature Acquisition Methods for Static Feature Settings Henrik von Kleist et.al. 2312.03619v2 null
2023-12-05 Dexterous Functional Grasping Ananye Agarwal et.al. 2312.02975v1 null
2023-12-05 Describing Differences in Image Sets with Natural Language Lisa Dunlap et.al. 2312.02974v1 link
2023-12-05 GauHuman: Articulated Gaussian Splatting from Monocular Human Videos Shoukang Hu et.al. 2312.02973v1 link
2023-12-05 Detecting algorithmic bias in medical AI-models Jeffrey Smith et.al. 2312.02959v1 null
2023-12-05 Classification for everyone : Building geography agnostic models for fairer recognition Akshat Jindal et.al. 2312.02957v1 null
2023-12-05 Choroidalyzer: An open-source, end-to-end pipeline for choroidal analysis in optical coherence tomography Justin Engelmann et.al. 2312.02956v1 null
2023-12-05 An alternating peak-optimization method for optimal trajectory generation of quadrotor drones Wytze A. B. de Vries et.al. 2312.02944v1 null
2023-12-05 Fast CT anatomic localization algorithm Amit Oved et.al. 2312.02941v1 null
2023-12-05 Drag-A-Video: Non-rigid Video Editing with Point-based Interaction Yao Teng et.al. 2312.02936v1 null
2023-12-06 WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation Jiachen Lu et.al. 2312.02934v2 link
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141v1 null
2023-12-04 Fast View Synthesis of Casual Videos Yao-Chih Lee et.al. 2312.02135v1 null
2023-12-04 GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians Liangxiao Hu et.al. 2312.02134v1 null
2023-12-04 Hot PATE: Private Aggregation of Distributions for Diverse Task Edith Cohen et.al. 2312.02132v1 null
2023-12-04 Can we truly transfer an actor's genuine happiness to avatars? An investigation into virtual, real, posed and spontaneous faces Vitor Miguel Xavier Peres et.al. 2312.02128v1 null
2023-12-04 Cosmic star-formation history and black hole accretion history inferred from the JWST mid-infrared source counts Seong Jin Kim et.al. 2312.02090v1 null
2023-12-05 VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence Yuchao Gu et.al. 2312.02087v2 null
2023-12-04 Integrating AI into CCTV Systems: A Comprehensive Evaluation of Smart Video Surveillance in Community Space Shanle Yao et.al. 2312.02078v1 null
2023-12-04 GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians Shenhan Qian et.al. 2312.02069v1 null
2023-12-04 TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding Shuhuai Ren et.al. 2312.02051v1 null
2023-12-01 Dense Optical Tracking: Connecting the Dots Guillaume Le Moing et.al. 2312.00786v1 null
2023-12-01 Sequential Modeling Enables Scalable Learning for Large Vision Models Yutong Bai et.al. 2312.00785v1 null
2023-12-01 MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video Hengyi Wang et.al. 2312.00778v1 null
2023-12-01 VideoBooth: Diffusion-based Video Generation with Image Prompts Yuming Jiang et.al. 2312.00777v1 null
2023-12-01 Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans Homanga Bharadhwaj et.al. 2312.00775v1 null
2023-12-01 Explaining Knock-on Effects of Bias Mitigation Svetoslav Nizhnichenkov et.al. 2312.00765v1 null
2023-12-04 Deep Unlearning: Fast and Efficient Training-free Approach to Controlled Forgetting Sangamesh Kodge et.al. 2312.00761v2 null
2023-12-01 Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals Tam Nguyen et.al. 2312.00751v1 null
2023-12-01 Tight-minimal dichotomies in Banach spaces Alejandra C. Cáceres-Rigo et.al. 2312.00721v1 null
2023-12-01 GIFT: Generative Interpretable Fine-Tuning Transformers Chinmay Savadikar et.al. 2312.00700v1 link
2023-11-30 Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living Dominick Reilly et.al. 2311.18840v1 null
2023-11-30 TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios Lihao Liu et.al. 2311.18839v1 null
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837v1 null
2023-11-30 ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834v1 null
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830v1 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829v1 null
2023-11-30 Motion-Conditioned Image Animation for Video Editing Wilson Yan et.al. 2311.18827v1 null
2023-11-30 CAST: Cross-Attention in Space and Time for Video Action Recognition Dongho Lee et.al. 2311.18825v1 link
2023-11-30 Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking Kaifeng Lyu et.al. 2311.18817v1 link
2023-11-30 BIOCLIP: A Vision Foundation Model for the Tree of Life Samuel Stevens et.al. 2311.18803v1 null
2023-11-30 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921v2 null
2023-11-29 Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving Yuqi Wang et.al. 2311.17918v1 link
2023-11-29 HUGS: Human Gaussian Splats Muhammed Kocabas et.al. 2311.17910v1 null
2023-11-29 SODA: Bottleneck Diffusion Models for Representation Learning Drew A. Hudson et.al. 2311.17901v1 null
2023-11-30 Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis Jinqi Luo et.al. 2311.17898v2 null
2023-11-29 On the geometry of tensor products over finite fields Stefano Lia et.al. 2311.17896v1 null
2023-11-29 Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation Shuangrui Ding et.al. 2311.17893v1 null
2023-11-29 TSDF-Sampling: Efficient Sampling for Neural Surface Field using Truncated Signed Distance Field Chaerin Min et.al. 2311.17878v1 null
2023-11-29 Enhancing Post-Hoc Explanation Benchmark Reliability for Image Classification Tristan Gomez et.al. 2311.17876v1 null
2023-11-29 On the Adversarial Robustness of Graph Contrastive Learning Methods Filippo Guerranti et.al. 2311.17853v1 null
2023-11-28 Panoptic Video Scene Graph Generation Jingkang Yang et.al. 2311.17058v1 link
2023-11-28 Self-Supervised Motion Magnification by Backpropagating Through Optical Flow Zhaoying Pan et.al. 2311.17056v1 null
2023-11-28 MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training Pavan Kumar Anasosalu Vasu et.al. 2311.17049v1 null
2023-11-28 Jets of foliations and $b^k$-algebroids Francis Bischoff et.al. 2311.17045v1 null
2023-11-28 LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models Yanwei Li et.al. 2311.17043v1 link
2023-11-29 Efficient In-Context Learning in Vision-Language Models for Egocentric Videos Keunwoo Peter Yu et.al. 2311.17041v2 null
2023-11-28 Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer Danah Yatim et.al. 2311.17009v1 null
2023-11-28 MVBench: A Comprehensive Multi-modal Video Understanding Benchmark Kunchang Li et.al. 2311.17005v1 link
2023-11-28 Mirković-Vilonen Polytopes from Combinatorics Mario Sanchez et.al. 2311.16979v1 null
2023-11-28 Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis Aman Yadav et.al. 2311.16965v1 null
2023-11-28 Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models Munan Ning et.al. 2311.16103v2 link
2023-11-27 GART: Gaussian Articulated Template Models Jiahui Lei et.al. 2311.16099v1 null
2023-11-27 On Bringing Robots Home Nur Muhammad Mahi Shafiullah et.al. 2311.16098v1 link
2023-11-27 CG-HOI: Contact-Guided 3D Human-Object Interaction Generation Christian Diller et.al. 2311.16097v1 null
2023-11-27 Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling Zhe Li et.al. 2311.16096v1 link
2023-11-27 Three-dimensional $\mathbb{Z}$ topological insulators without reflection symmetry Alexander C. Tyner et.al. 2311.16092v1 null
2023-11-27 BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification Dmitri Roussinov et.al. 2311.16083v1 link
2023-11-27 ViT-Lens-2: Gateway to Omni-modal Intelligence Weixian Lei et.al. 2311.16081v1 link
2023-11-27 Correlated Spectral and Recurrence Variations of Cygnus X-1 E. M. Broadbent et.al. 2311.16070v1 null
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060v1 link
2023-11-24 SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation Lingchen Meng et.al. 2311.14671v1 link
2023-11-24 JetLOV: Enhancing Jet Tree Tagging through Neural Network Learning of Optimal LundNet Variables Mauricio A. Diaz et.al. 2311.14654v1 link
2023-11-24 Learning in Deep Factor Graphs with Gaussian Belief Propagation Seth Nabarro et.al. 2311.14649v1 null
2023-11-24 Continuous football player tracking from discrete broadcast data Matthew J. Penn et.al. 2311.14642v1 null
2023-11-24 Emergent Topology in Many-Body Dissipative Quantum Chaos Antonio M. García-García et.al. 2311.14640v1 null
2023-11-24 Unsupervised high-throughput segmentation of cells and cell nuclei in quantitative phase images Julia Sistermanns et.al. 2311.14639v1 null
2023-11-24 ARIA: On the interaction between Architectures, Aggregation methods and Initializations in federated visual classification Vasilis Siomos et.al. 2311.14625v1 null
2023-11-24 Neural Style Transfer for Computer Games Eleftherios Ioannou et.al. 2311.14617v1 null
2023-11-24 Animate124: Animating One Image to 4D Dynamic Scene Yuyang Zhao et.al. 2311.14603v1 null
2023-11-24 A Metalearned Neural Circuit for Nonparametric Bayesian Inference Jake C. Snell et.al. 2311.14601v1 link
2023-11-22 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570v1 null
2023-11-22 Belted sum decompositions of fully augmented links Porter Morgan et.al. 2311.13540v1 null
2023-11-22 Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression Tam Thuc Do et.al. 2311.13539v1 null
2023-11-22 Leveraging CNNs and Ensemble Learning for Automated Disaster Image Classification Archit Rathod et.al. 2311.13531v1 null
2023-11-22 Applying Dimensionality Reduction as Precursor to LSTM-CNN Models for Classifying Imagery and Motor Signals in ECoG-Based BCIs Soham Bafana et.al. 2311.13507v1 link
2023-11-22 Current Topological and Machine Learning Applications for Bias Detection in Text Colleen Farrelly et.al. 2311.13495v1 null
2023-11-22 Benchmarking Toxic Molecule Classification using Graph Neural Networks and Few Shot Learning Bhavya Mehta et.al. 2311.13490v1 null
2023-11-22 Deep-learning-based acceleration of MRI for radiotherapy planning of pediatric patients with brain tumors Shahinur Alam et.al. 2311.13485v1 link
2023-11-22 Solution discovery via reconfiguration for problems in P Mario Grobler et.al. 2311.13478v1 null
2023-11-22 Experimentation in Early-Stage Video Game Startups: Practices and Challenges Henry Edison et.al. 2311.13462v1 null
2023-11-21 Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models David Stotko et.al. 2311.12796v1 null
2023-11-21 Quantifying Impairment and Disease Severity Using AI Models Trained on Healthy Subjects Boyang Yu et.al. 2311.12781v1 link
2023-11-21 Swift Parameter-free Attention Network for Efficient Super-Resolution Cheng Wan et.al. 2311.12770v1 link
2023-11-22 Investigating Weight-Perturbed Deep Neural Networks With Application in Iris Presentation Attack Detection Renu Sharma et.al. 2311.12764v2 link
2023-11-21 High-resolution Image-based Malware Classification using Multiple Instance Learning Tim Peters et.al. 2311.12760v1 link
2023-11-21 SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction Yuanhui Huang et.al. 2311.12754v1 link
2023-11-21 Image Transformation for IoT Time-Series Data: A Review Duygu Altunkaya et.al. 2311.12742v1 null
2023-11-21 Exploring Graph Classification Techniques Under Low Data Constraints: A Comprehensive Study Kush Kothari et.al. 2311.12737v1 null
2023-11-21 Not Just Training, Also Testing: High School Youths' Perspective-Taking through Peer Testing Machine Learning-Powered Applications L. Morales-Navarro et.al. 2311.12733v1 null
2023-11-21 Cascade Learning Localises Discriminant Features in Visual Scene Classification Junwen Wang et.al. 2311.12704v1 null
2023-11-20 Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation Wenhao Li et.al. 2311.12028v1 null
2023-11-20 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration Naoki Wake et.al. 2311.12015v1 null
2023-11-20 Evaluating Supervision Levels Trade-Offs for Infrared-Based People Counting David Latortue et.al. 2311.11974v1 null
2023-11-20 SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks Jin Ye et.al. 2311.11969v1 link
2023-11-20 Correlated Attention in Transformers for Multivariate Time Series Quang Minh Nguyen et.al. 2311.11959v1 null
2023-11-20 Tubular Curvature Filter: Implicit Pointwise Curvature Calculation Method for Tubular Objects Elifnur Sunger et.al. 2311.11931v1 null
2023-11-20 LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions Songhao Han et.al. 2311.11904v1 null
2023-11-20 Multimodal Characterization of Emotion within Multimedia Space Dayo Samuel Banjo et.al. 2311.11892v1 null
2023-11-20 SniffyArt: The Dataset of Smelling Persons Mathias Zinnen et.al. 2311.11888v1 null
2023-11-20 Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks Rami Haffar et.al. 2311.11882v1 link
2023-11-17 Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar et.al. 2311.10709v1 null
2023-11-17 SpACNN-LDVAE: Spatial Attention Convolutional Latent Dirichlet Variational Autoencoder for Hyperspectral Pixel Unmixing Soham Chitnis et.al. 2311.10701v1 null
2023-11-17 A note on the convergence of the Bayesian entropy estimator for exchangeable partitions Servet Martinez et.al. 2311.10698v1 null
2023-11-17 Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections Lihan Zha et.al. 2311.10678v1 link
2023-11-17 3D-TexSeg: Unsupervised Segmentation of 3D Texture using Mutual Transformer Learning Iyyakutti Iyappan Ganapathi et.al. 2311.10651v1 null
2023-11-17 User Dynamics-Aware Edge Caching and Computing for Mobile Virtual Reality Mushu Li et.al. 2311.10645v1 null
2023-11-17 Image-Domain Material Decomposition for Dual-energy CT using Unsupervised Learning with Data-fidelity Loss Junbo Peng et.al. 2311.10641v1 null
2023-11-17 Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks Benjamin Feuer et.al. 2311.10609v1 null
2023-11-17 Designing Reconfigurable Intelligent Systems with Markov Blankets Boris Sedlak et.al. 2311.10597v1 null
2023-11-17 FOCAL: A Cost-Aware Video Dataset for Active Learning Kiran Kokilepersaud et.al. 2311.10591v1 link
2023-11-16 Traffic Video Object Detection using Motion Prior Lihao Liu et.al. 2311.10092v1 null
2023-11-16 Moduli space of rank three logarithmic connections on the projective line with three poles Takafumi Matsumoto et.al. 2311.10071v1 null
2023-11-16 Inherently Interpretable Time Series Classification via Multiple Instance Learning Joseph Early et.al. 2311.10049v1 link
2023-11-16 On the potential of Carbon-Enhanced Metal-Poor stars for Galactic Archaeology Aruna Goswami et.al. 2311.10043v1 null
2023-11-16 Match and Locate: low-frequency monocular odometry based on deep feature matching Stepan Konev et.al. 2311.10034v1 null
2023-11-16 Revolutionizing Customer Interactions: Insights and Challenges in Deploying ChatGPT and Generative Chatbots for FAQs Feriel Khennouche et.al. 2311.09976v1 null
2023-11-16 From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning Jiansong Zhang et.al. 2311.09974v1 null
2023-11-16 VertDetect: Fully End-to-End 3D Vertebral Instance Segmentation Model Geoff Klein et.al. 2311.09958v1 null
2023-11-16 Harnessing Transformers: A Leap Forward in Lung Cancer Image Detection Amine Bechar et.al. 2311.09942v1 null
2023-11-17 A Framework for Monitoring and Retraining Language Models in Real-World Applications Jaykumar Kasundra et.al. 2311.09930v2 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221v1 null
2023-11-15 ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy Kirill Vishniakov et.al. 2311.09215v1 link
2023-11-15 Topology of Pulsar Profiles (ToPP). I. Graph theory method and classification of the EPN D. Vohl et.al. 2311.09201v1 null
2023-11-15 ExpM+NF: Differentially Private Machine Learning that Surpasses DPSGD Robert A. Bridges et.al. 2311.09200v1 null
2023-11-15 Domain Aligned CLIP for Few-shot Classification Muhammad Waleed Gondal et.al. 2311.09191v1 null
2023-11-15 ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models Jierui Li et.al. 2311.09182v1 null
2023-11-15 RBPGAN: Recurrent Back-Projection GAN for Video Super Resolution Dareen Hussein et.al. 2311.09178v1 null
2023-11-15 Model Agnostic Explainable Selective Regression via Uncertainty Estimation Andrea Pugnana et.al. 2311.09145v1 null
2023-11-15 Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets Christian Mahoney et.al. 2311.09133v1 null
2023-11-15 Cross-view and Cross-pose Completion for 3D Human Understanding Matthieu Armando et.al. 2311.09104v1 null
2023-11-14 MVSA-Net: Multi-View State-Action Recognition for Robust and Deployable Trajectory Generation Ehsan Asali et.al. 2311.08393v1 null
2023-11-14 USLR: an open-source tool for unbiased and smooth longitudinal registration of brain MR Adrià Casamitjana et.al. 2311.08371v1 link
2023-11-14 Inverse Learning with Extremely Sparse Feedback for Recommendation Guanyu Lin et.al. 2311.08302v1 null
2023-11-14 Level Set KSVD Omer Sapir et.al. 2311.08284v1 null
2023-11-14 TENT: Connect Language Models with IoT Sensors for Zero-Shot Activity Recognition Yunjiao Zhou et.al. 2311.08245v1 null
2023-11-14 MCMC to address model misspecification in Deep Learning classification of Radio Galaxies Devina Mohan et.al. 2311.08243v1 null
2023-11-14 Learning Physics-Inspired Regularization for Medical Image Registration with Hypernetworks Anna Reithmeir et.al. 2311.08239v1 link
2023-11-14 Counterfactual Explanation for Regression via Disentanglement in Latent Space Xuan Zhao et.al. 2311.08228v1 null
2023-11-14 Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images Zhiyun Song et.al. 2311.08225v1 null
2023-11-14 Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction Kunting Li et.al. 2311.08219v1 link
2023-11-13 GPT-4V(ision) as A Social Media Analysis Engine Hanjia Lyu et.al. 2311.07547v1 link
2023-11-13 mlscorecheck: Testing the consistency of reported performance scores and experiments in machine learning György Kovács et.al. 2311.07541v1 null
2023-11-13 FEMDA: a unified framework for discriminant analysis Pierre Houdouin et.al. 2311.07518v1 null
2023-11-13 Reducing the Need for Backpropagation and Discovering Better Optima With Explicit Optimizations of Neural Networks Jake Ryland Williams et.al. 2311.07498v1 null
2023-11-13 Towards Robotic Tree Manipulation: Leveraging Graph Representations Chung Hee Kim et.al. 2311.07479v1 null
2023-11-13 Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks Laura Fieback et.al. 2311.07477v1 null
2023-11-13 Masked Face Dataset Generation and Masked Face Recognition Rui Cai et.al. 2311.07475v1 link
2023-11-13 A Bayesian Approach to Strong Lens Finding in the Era of Wide-area Surveys Philip Holloway et.al. 2311.07455v1 null
2023-11-13 On the Robustness of Neural Collapse and the Neural Collapse of Robustness Jingtong Su et.al. 2311.07444v1 null
2023-11-13 Optimising Human-AI Collaboration by Learning Convincing Explanations Alex J. Chan et.al. 2311.07426v1 null
2023-11-10 Learning Human Action Recognition Representations Without Real Humans Howard Zhong et.al. 2311.06231v1 link
2023-11-10 Semantic-aware Video Representation for Few-shot Action Recognition Yutao Tang et.al. 2311.06218v1 null
2023-11-10 MultiIoT: Towards Large-scale Multisensory Learning for the Internet of Things Shentong Mo et.al. 2311.06217v1 null
2023-11-10 Deep learning segmentation of fibrous cap in intravascular optical coherence tomography images Juhwan Lee et.al. 2311.06202v1 null
2023-11-10 An Automated Pipeline for Tumour-Infiltrating Lymphocyte Scoring in Breast Cancer Adam J Shephard et.al. 2311.06185v1 link
2023-11-10 Automatic Report Generation for Histopathology images using pre-trained Vision Transformers Saurav Sengupta et.al. 2311.06176v1 null
2023-11-10 Two vertex geometrically irreducible algebras Grzegorz Bobinski et.al. 2311.06173v1 null
2023-11-10 Time Scale Network: A Shallow Neural Network For Time Series Data Trevor Meyer et.al. 2311.06170v1 null
2023-11-10 Deep Fast Vision: A Python Library for Accelerated Deep Transfer Learning Vision Prototyping Fabi Prezja et.al. 2311.06169v1 link
2023-11-10 Going beyond persistent homology using persistent homology Johanna Immonen et.al. 2311.06152v1 null
2023-11-09 FogROS2-Sky: Optimizing Latency and Cost for Multi-Cloud Robot Applications Kaiyuan Chen et.al. 2311.05600v1 null
2023-11-09 A Coefficient Makes SVRG Effective Yida Yin et.al. 2311.05589v1 link
2023-11-09 Outlier-Robust Wasserstein DRO Sloan Nietert et.al. 2311.05573v1 link
2023-11-09 Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach Cristina Palmero et.al. 2311.05567v1 null
2023-11-09 Disentangling Quantum and Classical Contributions in Hybrid Quantum Machine Learning Architectures Michael Kölle et.al. 2311.05559v1 null
2023-11-09 L-WaveBlock: A Novel Feature Extractor Leveraging Wavelets for Generative Adversarial Networks Mirat Shah et.al. 2311.05548v1 null
2023-11-09 BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis Hao-Bin Duan et.al. 2311.05521v1 null
2023-11-09 Dirichlet Active Learning Kevin Miller et.al. 2311.05501v1 null
2023-11-09 Retinal OCT Synthesis with Denoising Diffusion Probabilistic Models for Layer Segmentation Yuli Wu et.al. 2311.05479v1 null
2023-11-09 Robust Retraining-free GAN Fingerprinting via Personalized Normalization Jianwei Fei et.al. 2311.05478v1 null
2023-11-08 Towards Few-Annotation Learning in Computer Vision: Application to Image Classification and Object Detection tasks Quentin Bouniot et.al. 2311.04888v1 null
2023-11-08 Are foundation models efficient for medical image segmentation? Danielle Ferreira et.al. 2311.04847v1 null
2023-11-08 Bayesian multi-band fitting of alerts for kilonovae detection Biswajit Biswas et.al. 2311.04845v1 null
2023-11-08 Hierarchically Gated Recurrent Neural Network for Sequence Modeling Zhen Qin et.al. 2311.04823v1 link
2023-11-08 A Lightweight Architecture for Real-Time Neuronal-Spike Classification Muhammad Ali Siddiqi et.al. 2311.04808v1 null
2023-11-08 Determination of toxic comments and unintended model bias minimization using Deep learning approach Md Azim Khan et.al. 2311.04789v1 null
2023-11-08 VioLA: Aligning Videos to 2D LiDAR Scans Jun-Jee Chao et.al. 2311.04783v1 null
2023-11-08 FetMRQC: an open-source machine learning framework for multi-centric fetal brain MRI quality control Thomas Sanchez et.al. 2311.04780v1 link
2023-11-08 GCS-ICHNet: Assessment of Intracerebral Hemorrhage Prognosis using Self-Attention with Domain Knowledge Integration Xuhao Shan et.al. 2311.04772v1 link
2023-11-08 An attention-based deep learning network for predicting Platinum resistance in ovarian cancer Haoming Zhuang et.al. 2311.04769v1 null
2023-11-08 Video Instance Matting Jiachen Li et.al. 2311.04212v2 link
2023-11-07 JPAVE: A Generation and Classification-based Model for Joint Product Attribute Prediction and Value Extraction Zhongfen Deng et.al. 2311.04196v1 link
2023-11-07 Linear to circular conversion in the polarized radio emission of a magnetar Marcus E. Lower et.al. 2311.04195v1 null
2023-11-07 SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish Yevhen Kostiuk et.al. 2311.04189v1 null
2023-11-07 A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis Dipanjyoti Paul et.al. 2311.04157v1 link
2023-11-07 Galaxy Spectra neural Network (GaSNet). II. Using Deep Learning for Spectral Classification and Redshift Predictions Fucheng Zhong et.al. 2311.04146v1 null
2023-11-07 I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models Shiwei Zhang et.al. 2311.04145v1 null
2023-11-07 Modelling Sentiment Analysis: LLMs and data augmentation techniques Guillem Senabre Prades et.al. 2311.04139v1 null
2023-11-07 Improved Topological Preservation in 3D Axon Segmentation and Centerline Detection using Geometric Assessment-driven Topological Smoothing (GATS) Nina I. Shamsi et.al. 2311.04116v1 null
2023-11-07 Joint modelling of recurrent and terminal events with discretely-distributed non-parametric frailty: application on re-hospitalizations and death in heart failure patients Chiara Masci et.al. 2311.04103v1 null
2023-11-06 A Classification of Graphs through Quadratic Embedding Constants and Clique Graph Insights Edy Tri Baskoro et.al. 2311.03342v1 null
2023-11-06 Tackling Concept Shift in Text Classification using Entailment-style Modeling Sumegh Roychowdhury et.al. 2311.03320v1 null
2023-11-06 A Foundation Model for Music Informatics Minz Won et.al. 2311.03318v1 link
2023-11-06 FATE: Feature-Agnostic Transformer-based Encoder for learning generalized embedding spaces in flow cytometry data Lisa Weijler et.al. 2311.03314v1 link
2023-11-06 A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation Qitao Zhao et.al. 2311.03312v1 null
2023-11-06 Advancing Post Hoc Case Based Explanation with Feature Highlighting Eoin Kenny et.al. 2311.03246v1 null
2023-11-06 Machine Learning-Based Tea Leaf Disease Detection: A Comprehensive Review Faruk Ahmed et.al. 2311.03240v1 null
2023-11-06 Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources Haotian Zheng et.al. 2311.03236v1 null
2023-11-06 Segmentation of Drone Collision Hazards in Airborne RADAR Point Clouds Using PointNet Hector Arroyo et.al. 2311.03221v1 null
2023-11-06 Leveraging Transformers to Improve Breast Cancer Classification and Risk Assessment with Multi-modal and Longitudinal Data Yiqiu Shen et.al. 2311.03217v1 null
2023-11-03 LOTUS: Continual Imitation Learning for Robot Manipulation Through Unsupervised Skill Discovery Weikang Wan et.al. 2311.02058v1 null
2023-11-03 MetaFast: Enabling Fast Metagenomic Classification via Seed Counting and Edit Distance Approximation Arvid E. Gollwitzer et.al. 2311.02029v1 null
2023-11-03 A Structured Pruning Algorithm for Model-based Deep Learning Chicago Park et.al. 2311.02003v1 null
2023-11-03 Detection of keratoconus Diseases using deep Learning AKM Enzam-Ul Haque et.al. 2311.01996v1 null
2023-11-03 Obtaining Explainable Classification Models using Distributionally Robust Optimization Sanjeeb Dash et.al. 2311.01994v1 null
2023-11-03 Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation Shichao Dong et.al. 2311.01989v1 null
2023-11-06 RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches Jiayuan Gu et.al. 2311.01977v2 null
2023-11-03 Welded graphs, Wirtinger groups and knotted punctured spheres Benjamin Audoux et.al. 2311.01922v1 null
2023-11-03 Contrast-Agnostic Groupwise Registration by Robust PCA for Quantitative Cardiac MRI Xinqi Li et.al. 2311.01916v1 null
2023-11-03 VQPy: An Object-Oriented Approach to Modern Video Analytics Shan Yu et.al. 2311.01623v1 null
2023-11-02 Tailoring Mixup to Data using Kernel Warping functions Quentin Bouniot et.al. 2311.01434v1 link
2023-11-02 Identifying Alzheimer Disease Dementia Levels Using Machine Learning Methods Md Gulzar Hussain et.al. 2311.01428v1 null
2023-11-02 Exploring Deep Learning Techniques for Glaucoma Detection: A Comprehensive Review Aized Amin Soofi et.al. 2311.01425v1 null
2023-11-02 Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data Cheng-Hao Tu et.al. 2311.01420v1 null
2023-11-02 Learning to See Physical Properties with Active Sensing Motor Policies Gabriel B. Margolis et.al. 2311.01405v1 null
2023-11-02 Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors Gabriele M. Caddeo et.al. 2311.01380v1 link
2023-11-02 Deep learning based Image Compression for Microscopy Images: An Empirical Study Yu Zhou et.al. 2311.01352v1 null
2023-11-02 Unreading Race: Purging Protected Features from Chest X-ray Embeddings Tobias Weber et.al. 2311.01349v1 null
2023-11-02 Scattering Vision Transformer: Spectral Mixing Matters Badri N. Patro et.al. 2311.01310v1 null
2023-11-02 Hybrid-Fusion Transformer for Multisequence MRI Jihoon Cho et.al. 2311.01308v1 null
2023-11-01 Software Repositories and Machine Learning Research in Cyber Security Mounika Vanamala et.al. 2311.00691v1 null
2023-11-01 What User Behaviors Make the Differences During the Process of Visual Analytics? Shahin Doroudian et.al. 2311.00690v1 null
2023-11-01 Deep Learning-Based Classification of Gamma Photon Interactions in Room-Temperature Semiconductor Radiation Detectors Sandeep K. Chaudhuri et.al. 2311.00682v1 null
2023-11-01 Latent Space Translation via Semantic Alignment Valentino Maiorca et.al. 2311.00664v1 link
2023-11-01 Rediscussion of eclipsing binaries. Paper XV. The B-type supergiant system V1765 Cygni John Southworth et.al. 2311.00655v1 null
2023-11-02 Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning Richard Bornemann et.al. 2311.00651v2 null
2023-11-01 Understanding the Issues and Causes in WebAssembly Application Development: A Mining-based Study Muhammad Waseem et.al. 2311.00646v1 null
2023-11-01 A Bi-level Framework for Traffic Accident Duration Prediction: Leveraging Weather and Road Condition Data within a Practical Optimum Pipeline Rafat Tabassum Sukonna et.al. 2311.00634v1 null
2023-11-01 Controllable Music Production with Diffusion Models and Guidance Gradients Mark Levy et.al. 2311.00613v1 null
2023-11-01 A Robust Deep Learning Method with Uncertainty Estimation for the Pathological Classification of Renal Cell Carcinoma based on CT Images Ni Yao et.al. 2311.00567v1 null
2023-10-31 Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders Srijan Das et.al. 2310.20704v1 null
2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction Xinyuan Chen et.al. 2310.20700v1 null
2023-10-31 StairNet: Visual Recognition of Stairs for Human-Robot Locomotion Andrew Garrett Kurbis et.al. 2310.20666v1 null
2023-10-31 Performance Improvement in Multi-class Classification via Automated Hierarchy Generation and Exploitation through Extended LCPN Schemes Celal Alagoz et.al. 2310.20641v1 null
2023-10-31 Deepfake detection by exploiting surface anomalies: the SurFake approach Andrea Ciamarra et.al. 2310.20621v1 null
2023-10-31 Enhanced Synthetic MRI Generation from CT Scans Using CycleGAN with Feature Extraction Saba Nikbakhsh et.al. 2310.20604v1 null
2023-10-31 Finiteness properties for Shimura curves and modified diagonal cycles Congling Qiu et.al. 2310.20600v1 null
2023-10-31 Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward Alignment Tahereh Toosi et.al. 2310.20599v1 link
2023-10-31 Tracially Complete C-Algebras* José R. Carrión et.al. 2310.20594v1 null
2023-10-31 Strongly Magnetized Tidal Disruption Event Disks via Stream Injection in GRMHD Brandon Curd et.al. 2310.20592v1 null
2023-10-29 Improved Motor Imagery Classification Using Adaptive Spatial Filters Based on Particle Swarm Optimization Algorithm Xiong Xiong et.al. 2310.19202v1 null
2023-10-29 Enhancing Motor Imagery Decoding in Brain Computer Interfaces using Riemann Tangent Space Mapping and Cross Frequency Coupling Xiong Xiong et.al. 2310.19198v1 null
2023-10-29 A Survey on Watching Social Issue Videos among YouTube and TikTok Users Shuo Niu et.al. 2310.19193v1 null
2023-10-29 Subjective Quality Evaluation of Point Clouds Using a Head Mounted Display Joao Prazeres et.al. 2310.19179v1 null
2023-10-29 Robustifying Language Models with Test-Time Adaptation Noah Thomas McDermott et.al. 2310.19177v1 null
2023-10-29 Predicting recovery following stroke: deep learning, multimodal data and feature selection using explainable AI Adam White et.al. 2310.19174v1 null
2023-10-29 BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping Srikumar Sastry et.al. 2310.19168v1 link
2023-10-29 Unified Representation for Non-compositional and Compositional Expressions Ziheng Zeng et.al. 2310.19127v1 null
2023-10-29 Efficient IoT Inference via Context-Awareness Mohammad Mehdi Rastikerdar et.al. 2310.19112v1 null
2023-10-29 Pushdown Layers: Encoding Recursive Structure in Transformer Language Models Shikhar Murty et.al. 2310.19089v1 null
2023-10-27 Addressing GAN Training Instabilities via Tunable Classification Losses Monica Welfert et.al. 2310.18291v1 null
2023-10-27 PlantPlotGAN: A Physics-Informed Generative Adversarial Network for Plant Disease Prediction Felipe A. Lopes et.al. 2310.18268v1 null
2023-10-27 MalFake: A Multimodal Fake News Identification for Malayalam using Recurrent Neural Networks and VGG-16 Adhish S. Sujan et.al. 2310.18263v1 null
2023-10-27 Edge AI-Based Vein Detector for Efficient Venipuncture in the Antecubital Fossa Edwin Salcedo et.al. 2310.18234v1 null
2023-10-27 TBDLNet: a network for classifying multidrug-resistant and drug-sensitive tuberculosis Ziquan Zhu et.al. 2310.18222v1 null
2023-10-27 ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models Benjamin Feuer et.al. 2310.18208v1 link
2023-10-27 Artifact-Robust Graph-Based Learning in Digital Pathology Saba Heidari Gheshlaghi et.al. 2310.18192v1 null
2023-10-27 Globular clusters and bar: captured or not captured? Anton A. Smirnov et.al. 2310.18172v1 null
2023-10-27 Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN Neeraj Kumar et.al. 2310.18169v1 null
2023-10-27 DESiRED -- Dynamic, Enhanced, and Smart iRED: A P4-AQM with Deep Reinforcement Learning and In-band Network Telemetry Leandro C. de Almeida et.al. 2310.18159v1 null
2023-10-26 A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection Anas Al-lahham et.al. 2310.17650v1 null
2023-10-26 torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP Yoshitomo Matsubara et.al. 2310.17644v1 link
2023-10-26 Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models Tsun-Hsuan Wang et.al. 2310.17642v1 null
2023-10-26 Skew Products on the Berkovich Projective Line Richard A. P. Birkett et.al. 2310.17628v1 null
2023-10-26 A Survey on Transferability of Adversarial Examples across Deep Neural Networks Jindong Gu et.al. 2310.17626v1 link
2023-10-26 MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations Ajay Mandlekar et.al. 2310.17596v1 null
2023-10-26 Linear $x$-coordinate relations of triples on elliptic curves Jerson Caro et.al. 2310.17592v1 null
2023-10-26 A minimax optimal control approach for robust neural ODEs Cristina Cipriani et.al. 2310.17584v1 null
2023-10-26 BLIS-Net: Classifying and Analyzing Signals on Graphs Charles Xu et.al. 2310.17579v1 null
2023-10-26 Knots bounding non-isotopic ribbon disks Jeffrey Meier et.al. 2310.17564v1 null
2023-10-25 RDBench: ML Benchmark for Relational Databases Zizhao Zhang et.al. 2310.16837v1 link
2023-10-25 TD-MPC2: Scalable, Robust World Models for Continuous Control Nicklas Hansen et.al. 2310.16828v1 null
2023-10-26 Deep machine learning for meteor monitoring: advances with transfer learning and gradient-weighted class activation mapping Eloy Peña-Asensio et.al. 2310.16826v2 null
2023-10-25 Uncovering a new group of T Tauri stars in the Taurus-Auriga molecular complex from Gaia and GALEX data Ana Inés Gómez de Castro et.al. 2310.16820v1 null
2023-10-25 Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation Daniel Saragih et.al. 2310.16794v1 null
2023-10-25 Navigating Socio-Emotional Risk through Comfort-Building in a Physics Teaching Community of Practice: A Case Study Maggie Mahmood et.al. 2310.16778v1 null
2023-10-25 IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery Bhavuk Singhal et.al. 2310.16761v1 null
2023-10-25 Interferometric Neural Networks Arun Sehrawat et.al. 2310.16742v1 link
2023-10-25 A No-Reference Quality Assessment Method for Digital Human Head Yingjie Zhou et.al. 2310.16732v1 null
2023-10-25 Spherical Wavefront Near-Field DoA Estimation in THz Automotive Radar Ahmet M. Elbir et.al. 2310.16724v1 null
2023-10-24 From Posterior Sampling to Meaningful Diversity in Image Restoration Noa Cohen et.al. 2310.16047v1 null
2023-10-24 Finetuning Offline World Models in the Real World Yunhai Feng et.al. 2310.16029v1 null
2023-10-24 Human-in-the-Loop Task and Motion Planning for Imitation Learning Ajay Mandlekar et.al. 2310.16014v1 null
2023-10-24 CVPR 2023 Text Guided Video Editing Competition Jay Zhangjie Wu et.al. 2310.16003v1 null
2023-10-24 Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning Xin Xing et.al. 2310.15985v1 link
2023-10-24 Geometry-Aware Video Quality Assessment for Dynamic Digital Human Zicheng Zhang et.al. 2310.15984v1 null
2023-10-24 Minimax Forward and Backward Learning of Evolving Tasks with Performance Guarantees Verónica Álvarez et.al. 2310.15974v1 link
2023-10-24 Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection Manyuan Zhang et.al. 2310.15955v1 null
2023-10-25 Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles Xing Shen et.al. 2310.15952v2 null
2023-10-24 ShARc: Shape and Appearance Recognition for Person Identification In-the-wild Haidong Zhu et.al. 2310.15946v1 null
2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Haonan Qiu et.al. 2310.15169v1 null
2023-10-23 Bitrate Ladder Prediction Methods for Adaptive Video Streaming: A Review and Benchmark Ahmed Telili et.al. 2310.15163v1 null
2023-10-23 Linear Representations of Sentiment in Large Language Models Curt Tigges et.al. 2310.15154v1 null
2023-10-23 Unlocking the Transferability of Tokens in Deep Models for Tabular Data Qi-Le Zhou et.al. 2310.15149v1 null
2023-10-23 When Should the FDA Inspect Pharmaceutical Manufacturing Facilities to Better Mitigate Drug Shortages? Daniel Kosmas et.al. 2310.15146v1 null
2023-10-23 Novel-View Acoustic Synthesis from 3D Reconstructed Rooms Byeongjoo Ahn et.al. 2310.15130v1 link
2023-10-23 Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models Gabriel Sarch et.al. 2310.15127v1 null
2023-10-23 SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution Weihao Lin et.al. 2310.15115v1 null
2023-10-23 The Self 2.0: How AI-Enhanced Self-Clones Transform Self-Perception and Improve Presentation Skills Qingxiao Zheng et.al. 2310.15112v1 null
2023-10-23 Matryoshka Diffusion Models Jiatao Gu et.al. 2310.15111v1 null
2023-10-20 Using Human-like Mechanism to Weaken Effect of Pre-training Weight Bias in Face-Recognition Convolutional Neural Network Haojiang Ying et.al. 2310.13674v1 null
2023-10-23 Explainable Depression Symptom Detection in Social Media Eliseo Bao Souto et.al. 2310.13664v2 null
2023-10-20 Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification Amr Keleg et.al. 2310.13661v1 link
2023-10-20 Optimal Transport for Measures with Noisy Tree Metric Tam Le et.al. 2310.13653v1 null
2023-10-20 Principal $2$-blocks with wreathed defect groups up to splendid Morita equivalence Shigeo Koshitani et.al. 2310.13621v1 null
2023-10-20 Skin Lesion Segmentation Improved by Transformer-based Networks with Inter-scale Dependency Modeling Sania Eskandari et.al. 2310.13604v1 link
2023-10-20 Classification of quantum states of light using random measurements through a multimode fiber Saroch Leedumrongwatthanakun et.al. 2310.13599v1 null
2023-10-20 Longer-range Contextualized Masked Autoencoder Taekyung Kim et.al. 2310.13593v1 null
2023-10-20 POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization Elahe Vahdani et.al. 2310.13585v1 null
2023-10-20 Progressive Dual Priori Network for Generalized Breast Tumor Segmentation Li Wang et.al. 2310.13574v1 null
2023-10-19 Putting the Object Back into Video Object Segmentation Ho Kei Cheng et.al. 2310.12982v1 link
2023-10-19 Variational Inference for SDEs Driven by Fractional Noise Rembert Daems et.al. 2310.12975v1 null
2023-10-19 Frozen Transformers in Language Models Are Effective Visual Encoder Layers Ziqi Pang et.al. 2310.12973v1 link
2023-10-19 Bialgebra structures on flat Lie algebras Amine Bahayou et.al. 2310.12966v1 null
2023-10-19 End-to-End Delay Minimization based on Joint Optimization of DNN Partitioning and Resource Allocation for Cooperative Edge Inference Xinrui Ye et.al. 2310.12937v1 null
2023-10-19 Digital Twin-Enabled Intelligent DDoS Detection Mechanism for Autonomous Core Networks Yagmur Yigit et.al. 2310.12924v1 null
2023-10-19 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning Juan Rocamonde et.al. 2310.12921v1 null
2023-10-19 Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey Oriane Siméoni et.al. 2310.12904v1 link
2023-10-19 A Markovian dynamics for $C. elegans$ behavior across scales Antonio C. Costa et.al. 2310.12883v1 link
2023-10-19 Perceptual Assessment and Optimization of High Dynamic Range Image Rendering Peibei Cao et.al. 2310.12877v1 null
2023-10-18 SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks Mohammadreza Salehi et.al. 2310.12126v1 null
2023-10-18 Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture Daniel Y. Fu et.al. 2310.12109v1 null
2023-10-18 HSTR-Net: Reference Based Video Super-resolution for Aerial Surveillance with Dual Cameras H. Umut Suluhan et.al. 2310.12092v1 null
2023-10-18 Chemical Analysis of the Brightest Star of the Cetus II Ultra-Faint Dwarf Galaxy Candidate K. B. Webber et.al. 2310.12090v1 null
2023-10-18 One-Shot Imitation Learning: A Pose Estimation Perspective Pietro Vitiello et.al. 2310.12077v1 null
2023-10-18 Exploring Fairness in Pre-trained Visual Transformer based Natural and GAN Generated Image Detection Systems and Understanding the Impact of Image Compression in Fairness Manjary P. Gangan et.al. 2310.12076v1 null
2023-10-18 Black-Box Training Data Identification in GANs via Detector Networks Lukman Olagoke et.al. 2310.12063v1 null
2023-10-19 Robust Class-Conditional Distribution Alignment for Partial Domain Adaptation Sandipan Choudhuri et.al. 2310.12060v2 null
2023-10-18 Exact and efficient solutions of the LMC Multitask Gaussian Process model Olivier Truffinet et.al. 2310.12032v1 link
2023-10-18 CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation Philipp Borchert et.al. 2310.12024v1 link
2023-10-17 DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis Youngjoong Kwon et.al. 2310.11449v1 null
2023-10-18 4K4D: Real-Time 4D View Synthesis at 4K Resolution Zhen Xu et.al. 2310.11448v2 null
2023-10-18 EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Yaofang Liu et.al. 2310.11440v2 null
2023-10-17 Transitive generalized toggle groups containing a cycle Jonathan S. Bloom et.al. 2310.11387v1 null
2023-10-17 DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations Yazhou Zhang et.al. 2310.11374v1 null
2023-10-17 VECHR: A Dataset for Explainable and Robust Classification of Vulnerability Type in the European Court of Human Rights Shanshan Xu et.al. 2310.11368v1 null
2023-10-17 Lie Group Decompositions for Equivariant Neural Networks Mircea Mironenco et.al. 2310.11366v1 null
2023-10-17 Hybrid quantum-classical graph neural networks for tumor classification in digital pathology Anupama Ray et.al. 2310.11353v1 null
2023-10-17 The effect of stemming and lemmatization on Portuguese fake news text classification Lucca de Freitas Santos et.al. 2310.11344v1 null
2023-10-17 Influencing factors on false positive rates when classifying tumor cell line response to drug treatment Priyanka Vasanthakumari et.al. 2310.11329v1 null
2023-10-16 A Survey on Video Diffusion Models Zhen Xing et.al. 2310.10647v1 link
2023-10-16 Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting Zeyu Yang et.al. 2310.10642v1 link
2023-10-16 Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models Kevin Black et.al. 2310.10639v1 null
2023-10-16 Efficacy of Dual-Encoders for Extreme Multi-Label Classification Nilesh Gupta et.al. 2310.10636v1 null
2023-10-16 Overcoming the Rayleigh limit in extremely low SNR Hyunsoo Choi et.al. 2310.10633v1 null
2023-10-16 Video Language Planning Yilun Du et.al. 2310.10625v1 null
2023-10-16 DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing Jia-Wei Liu et.al. 2310.10624v1 null
2023-10-16 BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation Ji Qi et.al. 2310.10586v1 null
2023-10-16 RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets Zhicheng Cai et.al. 2310.10563v1 link
2023-10-16 Deep learning applied to EEG data with different montages using spatial attention Dung Truong et.al. 2310.10550v1 null
2023-10-13 An Unbiased Look at Datasets for Visuo-Motor Pre-Training Sudeep Dasari et.al. 2310.09289v1 null
2023-10-13 Disentangled Latent Spaces Facilitate Data-Driven Auxiliary Learning Geri Skenderi et.al. 2310.09278v1 null
2023-10-13 A Hybrid Approach for Depression Classification: Random Forest-ANN Ensemble on Motor Activity Signals Anket Patil et.al. 2310.09277v1 null
2023-10-13 PromptRE: Weakly-Supervised Document-Level Relation Extraction via Prompting-Based Data Programming Chufan Gao et.al. 2310.09265v1 null
2023-10-13 Political claim identification and categorization in a multilingual setting: First experiments Urs Zaberer et.al. 2310.09256v1 null
2023-10-13 It's an Alignment, Not a Trade-off: Revisiting Bias and Variance in Deep Models Lin Chen et.al. 2310.09250v1 null
2023-10-13 A Multifaceted Look at Starlink Performance Nitinder Mohan et.al. 2310.09242v1 null
2023-10-13 Time CNN and Graph Convolution Network for Epileptic Spike Detection in MEG Data Pauline Mouches et.al. 2310.09236v1 null
2023-10-13 Ultrasound Image Segmentation of Thyroid Nodule via Latent Semantic Feature Co-Registration Xuewei Li et.al. 2310.09221v1 null
2023-10-13 PaLI-3 Vision Language Models: Smaller, Faster, Stronger Xi Chen et.al. 2310.09199v1 null
2023-10-12 Octopus: Embodied Vision-Language Programmer from Environmental Feedback Jingkang Yang et.al. 2310.08588v1 link
2023-10-12 Is Generalized Dynamic Novel View Synthesis from Monocular Videos Possible Today? Xiaoming Zhao et.al. 2310.08587v1 null
2023-10-12 Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes Haotong Lin et.al. 2310.08585v1 null
2023-10-12 Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video Shashanka Venkataramanan et.al. 2310.08584v1 null
2023-10-12 Universal Visual Decomposer: Long-Horizon Manipulation Made Easy Zichen Zhang et.al. 2310.08581v1 null
2023-10-12 Learning to Act from Actionless Videos through Dense Correspondences Po-Chen Ko et.al. 2310.08576v1 null
2023-10-12 Effective isometries of periodic shells Hussein Nassar et.al. 2310.08531v1 null
2023-10-12 LLM-augmented Preference Learning from Natural Language Inwon Kang et.al. 2310.08523v1 null
2023-10-12 Impact of time and note duration tokenizations on deep learning symbolic music modeling Nathan Fradet et.al. 2310.08497v1 link
2023-10-12 GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models Yuanchun Shen et.al. 2310.08487v1 link
2023-10-11 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He et.al. 2310.07702v1 link
2023-10-11 ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Bo Peng et.al. 2310.07697v1 null
2023-10-11 Large-scale photonic computing with nonlinear disordered media Hao Wang et.al. 2310.07690v1 null
2023-10-11 Deep Video Inpainting Guided by Audio-Visual Self-Supervision Kyuyeon Kim et.al. 2310.07663v1 null
2023-10-11 Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals Eleonora Lopez et.al. 2310.07648v1 null
2023-10-11 Attention-Map Augmentation for Hypercomplex Breast Cancer Classification Eleonora Lopez et.al. 2310.07633v1 null
2023-10-11 Differentiable Euler Characteristic Transforms for Shape Classification Ernst Roell et.al. 2310.07630v1 link
2023-10-11 Time-Resolved Reconstruction of Motion, Force, and Stiffness using Spectro-Dynamic MRI Max H. C. van Riel et.al. 2310.07622v1 null
2023-10-11 Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking Gustav Nikopensius et.al. 2310.07613v1 null
2023-10-11 QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking Liangming Pan et.al. 2310.07609v1 link
2023-10-10 Convivial Solipsism as a maximally perspectival interpretation Herve Zwirn et.al. 2310.06815v1 null
2023-10-10 A Supervised Embedding and Clustering Anomaly Detection method for classification of Mobile Network Faults R. Mosayebi et.al. 2310.06779v1 null
2023-10-10 Optical assembly of nanostructures mediated by surface roughness Robert G. Felsted et.al. 2310.06774v1 null
2023-10-10 Uni3D: Exploring Unified 3D Representation at Scale Junsheng Zhou et.al. 2310.06773v1 link
2023-10-10 Improved convergence rates for some kernel random forest algorithms Isidoros Iakovidis et.al. 2310.06760v1 null
2023-10-10 Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks Marc Rußwurm et.al. 2310.06743v1 link
2023-10-10 Multi-domain improves out-of-distribution and data-limited scenarios for medical image analysis Ece Ozkan et.al. 2310.06737v1 null
2023-10-10 S4Sleep: Elucidating the design space of deep-learning-based sleep stage classification models Tiezhi Wang et.al. 2310.06715v1 link
2023-10-10 Tertiary Lymphoid Structures Generation through Graph-based Diffusion Manuel Madeira et.al. 2310.06661v1 null
2023-10-10 Assessing the Impact of a Supervised Classification Filter on Flow-based Hybrid Network Anomaly Detection Dominik Macko et.al. 2310.06656v1 link
2023-10-09 FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Yuren Cong et.al. 2310.05922v1 null
2023-10-09 Enumerating Calabi-Yau Manifolds: Placing bounds on the number of diffeomorphism classes in the Kreuzer-Skarke list Aditi Chandra et.al. 2310.05909v1 null
2023-10-09 ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models Kaiwen Zhou et.al. 2310.05872v1 null
2023-10-10 Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models Guangzhi Sun et.al. 2310.05863v2 link
2023-10-09 Latent Wander: an Alternative Interface for Interactive and Serendipitous Discovery of Large AV Archives Yuchen Yang et.al. 2310.05835v1 null
2023-10-09 Write What You Want: Applying Text-to-video Retrieval to Audiovisual Archives Yuchen Yang et.al. 2310.05825v1 null
2023-10-09 Dipole-Spread Function Engineering for 6D Super-Resolution Microscopy Tingting Wu et.al. 2310.05810v1 null
2023-10-09 A Simple Open-Loop Baseline for Reinforcement Learning Locomotion Tasks Antonin Raffin et.al. 2310.05808v1 null
2023-10-09 Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis Haoyu Zhang et.al. 2310.05804v1 null
2023-10-10 Two-timescale Derivative Free Optimization for Performative Prediction with Markovian Data Haitong Liu et.al. 2310.05792v2 null
2023-10-06 Exploiting Transformer Activation Sparsity with Dynamic Inference Mikołaj Piórczyński et.al. 2310.04361v1 null
2023-10-06 SwimXYZ: A large-scale dataset of synthetic swimming motions and videos Fiche Guénolé et.al. 2310.04360v1 null
2023-10-06 Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services Dasol Choi et.al. 2310.04313v1 null
2023-10-06 Convergent ADMM Plug and Play PET Image Reconstruction Florent Sureau et.al. 2310.04299v1 null
2023-10-06 A Plug-and-Play Image Registration Network Junhao Hu et.al. 2310.04297v1 null
2023-10-06 Towards Non-contact 3D Ultrasound for Wrist Imaging Antony Jerald et.al. 2310.04296v1 null
2023-10-06 Spectroscopic variability of massive pre-main-sequence stars in M17 A. R. Derkink et.al. 2310.04287v1 null
2023-10-06 Multi-Industry Simplex : A Probabilistic Extension of GICS Maksim Papenkov et.al. 2310.04280v1 null
2023-10-06 Bringing Quantum Algorithms to Automated Machine Learning: A Systematic Review of AutoML Frameworks Regarding Extensibility for QML Algorithms Dennis Klau et.al. 2310.04238v1 null
2023-10-06 Written and spoken corpus of real and fake social media postings about COVID-19 Ng Bee Chin et.al. 2310.04237v1 null
2023-10-05 The Un-Kidnappable Robot: Acoustic Localization of Sneaking People Mengyu Yang et.al. 2310.03743v1 null
2023-10-05 Agent Instructs Large Language Models to be General Zero-Shot Reasoners Nicholas Crispino et.al. 2310.03710v1 link
2023-10-05 OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks Ofir Bar Tal et.al. 2310.03707v1 null
2023-10-05 Role of Spatial Coherence in Diffractive Optical Neural Networks Matthew J. Filipovich et.al. 2310.03679v1 null
2023-10-05 Certification of Deep Learning Models for Medical Image Segmentation Othmane Laousy et.al. 2310.03664v1 null
2023-10-05 Autoregressive Coefficients based Intelligent Protection of Transmission Lines Connected to Type-3 Wind Farms Pallav Kumar Bera et.al. 2310.03663v1 null
2023-10-05 Robustness-Guided Image Synthesis for Data-Free Quantization Jianhong Bai et.al. 2310.03661v1 null
2023-10-05 Balancing Autonomy and Alignment: A Multi-Dimensional Taxonomy for Autonomous LLM-powered Multi-Agent Architectures Thorsten Händler et.al. 2310.03659v1 null
2023-10-05 Strategic Evaluation: Subjects, Evaluators, and Society Benjamin Laufer et.al. 2310.03655v1 null
2023-10-05 CLEVRER-Humans: Describing Physical and Causal Events the Human Way Jiayuan Mao et.al. 2310.03635v1 null
2023-10-04 SemiReward: A General Reward Model for Semi-supervised Learning Siyuan Li et.al. 2310.03013v1 link
2023-10-04 High-dimensional SGD aligns with emerging outlier eigenspaces Gerard Ben Arous et.al. 2310.03010v1 null
2023-10-05 IBCL: Zero-shot Model Generation for Task Trade-offs in Continual Learning Pengyuan Lu et.al. 2310.02995v2 link
2023-10-04 Multiple Physics Pretraining for Physical Surrogate Models Michael McCabe et.al. 2310.02994v1 null
2023-10-04 UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network Siddhant Arora et.al. 2310.02973v1 null
2023-10-04 Fully Automatic Segmentation of Gross Target Volume and Organs-at-Risk for Radiotherapy Planning of Nasopharyngeal Carcinoma Mehdi Astaraki et.al. 2310.02972v1 null
2023-10-04 Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model Kai-Wei Chang et.al. 2310.02971v1 null
2023-10-05 Co-modeling the Sequential and Graphical Routes for Peptide Representation Learning Zihan Liu et.al. 2310.02964v2 link
2023-10-04 CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection Yang Cao et.al. 2310.02960v1 link
2023-10-04 HappyFeat -- An interactive and efficient BCI framework for clinical applications Arthur Desbois et.al. 2310.02948v1 null
2023-10-03 DREAM: Visual Decoding from Reversing Human Visual System Weihao Xia et.al. 2310.02265v1 null
2023-10-03 RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous Driving Tong Zhao et.al. 2310.02262v1 null
2023-10-03 Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages Ananya Joshi et.al. 2310.02249v1 null
2023-10-04 Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks Greg Yang et.al. 2310.02244v2 null
2023-10-03 MIS-AVioDD: Modality Invariant and Specific Representation for Audio-Visual Deepfake Detection Vinaya Sree Katamneni et.al. 2310.02234v1 null
2023-10-03 HoloNets: Spectral Convolutions do extend to Directed Graphs Christian Koke et.al. 2310.02232v1 null
2023-10-03 Extraction of Medication and Temporal Relation from Clinical Text by Harnessing Different Deep Learning Models Hangyu Tu et.al. 2310.02229v1 null
2023-10-03 Symmetry-based classification of exact flat bands in single and bilayer moiré systems Siddhartha Sarkar et.al. 2310.02218v1 null
2023-10-03 Learnable Data Augmentation for One-Shot Unsupervised Domain Adaptation Julio Ivan Davila Carrazco et.al. 2310.02201v1 null
2023-10-03 CNN photometric redshifts in the SDSS at $r\leq 20$ M. Treyer et.al. 2310.02173v1 null
2023-09-29 A Large Language Model Approach to Educational Survey Feedback Analysis Michael J. Parker et.al. 2309.17447v1 null
2023-10-02 LLM-grounded Video Diffusion Models Long Lian et.al. 2309.17444v2 null
2023-09-29 Classification of Potholes Based on Surface Area Using Pre-Trained Models of Convolutional Neural Network Chauhdary Fazeel Ahmad et.al. 2309.17426v1 null
2023-09-29 CNN-based automatic segmentation of Lumen & Media boundaries in IVUS images using closed polygonal chains Pavel Sinha et.al. 2309.17406v1 null
2023-09-29 AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition Andrew Rouditchenko et.al. 2309.17395v1 null
2023-09-29 Tree Cross Attention Leo Feng et.al. 2309.17388v1 null
2023-09-29 Adversarial Imitation Learning from Visual Observations using Latent Information Vittorio Giammarino et.al. 2309.17371v1 link
2023-09-29 SpinView: General interactive visual analysis tool for multiscale computational magnetism Qichen Xu et.al. 2309.17367v1 null
2023-09-29 Asynchronous Graph Generators Christopher P. Ley et.al. 2309.17335v1 null
2023-09-29 Multi-Depth Branches Network for Efficient Image Super-Resolution Huiyuan Tian et.al. 2309.17334v1 link
2023-09-29 Demystifying CLIP Data Hu Xu et.al. 2309.16671v2 link
2023-09-28 Decaf: Monocular Deformation Capture for Face and Hand Interactions Soshi Shimada et.al. 2309.16670v1 null
2023-09-28 Training a Large Video Model on a Single Machine in a Day Yue Zhao et.al. 2309.16669v1 link
2023-09-28 Novel Deep Learning Pipeline for Automatic Weapon Detection Haribharathi Sivakumar et.al. 2309.16654v1 null
2023-09-28 ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning Qiao Gu et.al. 2309.16650v1 null
2023-09-29 Mixup Your Own Pairs Yilei Wu et.al. 2309.16633v2 link
2023-09-28 Class Activation Map-based Weakly supervised Hemorrhage Segmentation using Resnet-LSTM in Non-Contrast Computed Tomography images Shreyas H Ramananda et.al. 2309.16627v1 null
2023-09-28 The twisting index in semitoric systems Jaume Alonso et.al. 2309.16614v1 null
2023-09-28 Exploiting Edge Features in Graphs with Fused Network Gromov-Wasserstein Distance Junjie Yang et.al. 2309.16604v1 null
2023-09-28 Can LLMs Effectively Leverage Structural Information for Graph Learning: When and Why Jin Huang et.al. 2309.16595v1 null
2023-09-27 SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations Sharath Girish et.al. 2309.15848v1 null
2023-09-27 Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing Brian Yan et.al. 2309.15826v1 null
2023-09-27 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation David Junhao Zhang et.al. 2309.15818v1 link
2023-09-27 Convolutional Networks with Oriented 1D Kernels Alexandre Kirchmeyer et.al. 2309.15812v1 link
2023-09-27 A Quantum-Classical Hybrid Block-Matching Algorithm in Noisy Environment using Dissimilarity Measure M. Martínez-Felipe et.al. 2309.15792v1 null
2023-09-27 Large Language Model Routing with Benchmark Datasets Tal Shnitzer et.al. 2309.15789v1 null
2023-09-27 One For All: Video Conversation is Feasible Without Video Instruction Tuning Ruyang Liu et.al. 2309.15785v1 null
2023-09-27 Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback Teresa Yeo et.al. 2309.15762v1 null
2023-09-27 Automated CT Lung Cancer Screening Workflow using 3D Camera Brian Teixeira et.al. 2309.15750v1 null
2023-09-27 Data-Driven Latent Space Representation for Robust Bipedal Locomotion Learning Guillermo A. Castillo et.al. 2309.15740v1 null
2023-09-26 Classification of symmetry-enriched topological quantum spin liquids Weicheng Ye et.al. 2309.15118v1 null
2023-09-26 Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow Zhenyu Jiang et.al. 2309.15110v1 null
2023-09-27 LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Yaohui Wang et.al. 2309.15103v2 null
2023-09-26 VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning Han Lin et.al. 2309.15091v1 null
2023-09-26 Video-adverb retrieval with compositional adverb-action embeddings Thomas Hummel et.al. 2309.15086v1 null
2023-09-26 Challenges of building medical image datasets for development of deep learning software in stroke Alessandro Fontanella et.al. 2309.15081v1 null
2023-09-26 On Excess Risk Convergence Rates of Neural Network Classifiers Hyunouk Ko et.al. 2309.15075v1 null
2023-09-26 Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding Christina Kassab et.al. 2309.15065v1 null
2023-09-26 QUILT: Effective Multi-Class Classification on Quantum Computers Using an Ensemble of Diverse Quantum Classifiers Daniel Silver et.al. 2309.15056v1 null
2023-09-26 Thalamic nuclei segmentation from T$_1$-weighted MRI: unifying and benchmarking state-of-the-art methods with young and old cohorts Brendan Williams et.al. 2309.15053v1 null
2023-09-25 Extreme Parkour with Legged Robots Xuxin Cheng et.al. 2309.14341v1 null
2023-09-25 Chop & Learn: Recognizing and Generating Object-State Compositions Nirat Saini et.al. 2309.14339v1 null
2023-09-25 Human-Assisted Continual Robot Learning with Foundation Models Meenal Parakh et.al. 2309.14321v1 null
2023-09-25 MUTEX: Learning Unified Policies from Multimodal Task Specifications Rutav Shah et.al. 2309.14320v1 null
2023-09-25 DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning Qingjie Meng et.al. 2309.14306v1 null
2023-09-25 NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields Saeejith Nair et.al. 2309.14293v1 null
2023-09-25 CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free Monika Wysoczańska et.al. 2309.14289v1 null
2023-09-25 Comparison of One- Two- and Three- Dimensional CNN models for Drawing-Test-Based Diagnostics of the Parkinson's Disease Xuechao Wang et.al. 2309.14288v1 null
2023-09-26 Virtual Hyperspectral Images Using Symmetric Autoencoders Archisman Bhattacharjee et.al. 2309.14286v2 null
2023-09-25 OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding Hao Peng et.al. 2309.14258v1 link
2023-09-22 Robotic Offline RL from Internet Videos via Value-Function Pre-Training Chethan Bhateja et.al. 2309.13041v1 null
2023-09-22 Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? Xiaoxiao Sun et.al. 2309.13038v1 null
2023-09-22 Encoding optimization for quantum machine learning demonstrated on a superconducting transmon qutrit Shuxiang Cao et.al. 2309.13036v1 null
2023-09-22 Performance Analysis of UNet and Variants for Medical Image Segmentation Walid Ehab et.al. 2309.13013v1 null
2023-09-22 Pursuing Counterfactual Fairness via Sequential Autoencoder Across Domains Yujie Lin et.al. 2309.13005v1 null
2023-09-22 Braid groups, elliptic curves, and resolving the quartic Peter Huxford et.al. 2309.12999v1 null
2023-09-22 License Plate Recognition Based On Multi-Angle View Model Dat Tran-Anh et.al. 2309.12972v1 null
2023-09-22 PI-RADS v2 Compliant Automated Segmentation of Prostate Zones Using co-training Motivated Multi-task Dual-Path CNN Arnab Das et.al. 2309.12970v1 null
2023-09-22 Detect Every Thing with Few Examples Xinyu Zhang et.al. 2309.12969v1 link
2023-09-22 Massive End-to-end Models for Short Search Queries Weiran Wang et.al. 2309.12963v1 null
2023-09-21 ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals Jeremy A. Collins et.al. 2309.12312v1 null
2023-09-21 LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent Jianing Yang et.al. 2309.12311v1 null
2023-09-21 TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning Chaeyoung Jung et.al. 2309.12306v1 null
2023-09-22 PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation Shilin Yan et.al. 2309.12303v2 link
2023-09-21 See to Touch: Learning Tactile Dexterity through Visual Incentives Irmak Guzey et.al. 2309.12300v1 null
2023-09-21 The Broad Impact of Feature Imitation: Neural Enhancements Across Financial, Speech, and Physiological Domains Reza Khanmohammadi et.al. 2309.12279v1 null
2023-09-21 Enabling Quartile-based Estimated-Mean Gradient Aggregation As Baseline for Federated Image Classifications Yusen Wu et.al. 2309.12267v1 null
2023-09-21 Parallelizing non-linear sequential models over the sequence length Yi Heng Lim et.al. 2309.12252v1 null
2023-09-21 Adaptive Input-image Normalization for Solving Mode Collapse Problem in GAN-based X-ray Images Muhammad Muneeb Saad et.al. 2309.12245v1 null
2023-09-21 Model-based Clustering using Non-parametric Hidden Markov Models Elisabeth Gassiat et.al. 2309.12238v1 null
2023-09-20 A Large-scale Dataset for Audio-Language Representation Learning Luoyi Sun et.al. 2309.11500v1 null
2023-09-20 FreeU: Free Lunch in Diffusion U-Net Chenyang Si et.al. 2309.11497v1 null
2023-09-21 Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning Tianbao Xie et.al. 2309.11489v2 null
2023-09-20 First detection of CO$_2$ emission in a Centaur: JWST NIRSpec observations of 39P/Oterma O. Harrington Pinto et.al. 2309.11486v1 null
2023-09-20 Multi-Label Takagi-Sugeno-Kang Fuzzy System Qiongdan Lou et.al. 2309.11469v1 null
2023-09-20 Budget-Aware Pruning: Handling Multiple Domains with Less Parameters Samuel Felipe dos Santos et.al. 2309.11464v1 null
2023-09-20 AudioFool: Fast, Universal and synchronization-free Cross-Domain Attack on Speech Recognition Mohamad Fakih et.al. 2309.11462v1 null
2023-09-20 SkeleTR: Towrads Skeleton-based Action Recognition in the Wild Haodong Duan et.al. 2309.11445v1 null
2023-09-20 A Systematic Review of Few-Shot Learning in Medical Imaging Eva Pachetti et.al. 2309.11433v1 null
2023-09-21 Video Screens for Hearing Research: Transmittance and Reflectance of Professional and Other Fabrics Jan Heeren et.al. 2309.11430v2 null
2023-09-19 Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context Rucha Deshpande et.al. 2309.10817v1 null
2023-09-19 Multisource Holography Grace Kuo et.al. 2309.10816v1 null
2023-09-19 Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Tianhua Zhang et.al. 2309.10814v1 link
2023-09-19 Semantic Text Compression for Classification Emrecan Kutay et.al. 2309.10809v1 null
2023-09-19 Multi-Context Dual Hyper-Prior Neural Image Compression Atefeh Khoshkhahtinat et.al. 2309.10799v1 null
2023-09-19 Multi-spectral Entropy Constrained Neural Compression of Solar Imagery Ali Zafari et.al. 2309.10791v1 null
2023-09-19 Guide Your Agent with Adaptive Multimodal Rewards Changyeon Kim et.al. 2309.10790v1 link
2023-09-19 Physics-Informed Machine Learning for Data Anomaly Detection, Classification, Localization, and Mitigation: A Review, Challenges, and Path Forward Mehdi Jabbari Zideh et.al. 2309.10788v1 null
2023-09-19 AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models Yuan Tseng et.al. 2309.10787v1 link
2023-09-19 Context-Aware Neural Video Compression on Solar Dynamics Observatory Atefeh Khoshkhahtinat et.al. 2309.10784v1 null
2023-09-19 Des-q: a quantum algorithm to construct and efficiently retrain decision trees for regression and binary classification Niraj Kumar et.al. 2309.09976v2 null
2023-09-18 Empirical Study of Mix-based Data Augmentation Methods in Physiological Time Series Data Peikun Guo et.al. 2309.09970v1 null
2023-09-18 vSHARP: variable Splitting Half-quadratic ADMM algorithm for Reconstruction of inverse-Problems George Yiasemis et.al. 2309.09954v1 null
2023-09-18 TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey Zhuoyang Chen et.al. 2309.09937v1 null
2023-09-18 Algebra of Self-Replication Lawrence S. Moss et.al. 2309.09931v1 null
2023-09-18 Evaluating Adversarial Robustness with Expected Viable Performance Ryan McCoppin et.al. 2309.09928v1 null
2023-09-18 Impact of Augmented reality system on elementary school ESL learners in country side of china: Motivations, achievements, behaviors and cognitive attainment Ijaz Ul Haq et.al. 2309.09894v1 null
2023-09-18 Not Enough Labeled Data? Just Add Semantics: A Data-Efficient Method for Inferring Online Health Texts Joseph Gatto et.al. 2309.09877v1 null
2023-09-18 Domain Generalization with Fourier Transform and Soft Thresholding Hongyi Pan et.al. 2309.09866v1 null
2023-09-18 Unsupervised Open-Vocabulary Object Localization in Videos Ke Fan et.al. 2309.09858v1 null
2023-09-18 Closing the Loop on Runtime Monitors with Fallback-Safe MPC Rohan Sinha et.al. 2309.08603v2 null
2023-09-15 Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes Fabien Delattre et.al. 2309.08588v1 null
2023-09-15 Compositional Foundation Models for Hierarchical Planning Anurag Ajay et.al. 2309.08587v1 null
2023-09-15 HINT: Healthy Influential-Noise based Training to Defend against Data Poisoning Attacks Minh-Hao Van et.al. 2309.08549v1 null
2023-09-15 Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens Minsu Kim et.al. 2309.08531v1 null
2023-09-15 Generalised Probabilistic Diffusion Scale-Spaces Pascal Peter et.al. 2309.08511v1 null
2023-09-15 Deep-learning-powered data analysis in plankton ecology Harshith Bachimanchi et.al. 2309.08500v1 link
2023-09-15 P-ROCKET: Pruning Random Convolution Kernels for Time Series Classification Shaowu Chen et.al. 2309.08499v1 link
2023-09-15 YCB-Ev: Event-vision dataset for 6DoF object pose estimation Pavel Rojtberg et.al. 2309.08482v1 link
2023-09-15 Current and future directions in network biology Marinka Zitnik et.al. 2309.08478v1 null
2023-09-14 Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning Zhiwu Qing et.al. 2309.07911v1 link
2023-09-14 Generative Image Dynamics Zhengqi Li et.al. 2309.07906v1 null
2023-09-14 Ambiguity-Aware In-Context Learning with Large Language Models Lingyu Gao et.al. 2309.07900v1 null
2023-09-14 SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions Yin Lin et.al. 2309.07856v1 null
2023-09-14 Two Timin': Repairing Smart Contracts With A Two-Layered Approach Abhinav Jain et.al. 2309.07841v1 null
2023-09-14 Text Classification of Cancer Clinical Trial Eligibility Criteria Yumeng Yang et.al. 2309.07812v1 null
2023-09-14 What Matters to Enhance Traffic Rule Compliance of Imitation Learning for Automated Driving Hongkuan Zhou et.al. 2309.07808v1 null
2023-09-14 Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary tasks Danae Sánchez Villegas et.al. 2309.07794v1 null
2023-09-14 A Multi-In and Multi-Out Dendritic Neuron Model and its Optimization Yu Ding et.al. 2309.07791v1 null
2023-09-15 Virchow: A Million-Slide Digital Pathology Foundation Model Eugene Vorontsov et.al. 2309.07778v2 null
2023-09-13 Contrastive Deep Encoding Enables Uncertainty-aware Machine-learning-assisted Histopathology Nirhoshan Sivaroopan et.al. 2309.07113v1 null
2023-09-13 Data Augmentation via Subgroup Mixup for Improving Fairness Madeline Navarro et.al. 2309.07110v1 null
2023-09-13 The end sum of surfaces Liam K. Axon et.al. 2309.07101v1 null
2023-09-13 Revisiting the classics: On the evolutionary origin of the "Fe II" and "He/N" spectral classes of novae E. Aydi et.al. 2309.07097v1 null
2023-09-13 RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline Mirko Usuelli et.al. 2309.07094v1 null
2023-09-13 Mitigating Group Bias in Federated Learning for Heterogeneous Devices Khotso Selialia et.al. 2309.07085v1 null
2023-09-13 The Boundaries of Verifiable Accuracy, Robustness, and Generalisation in Deep Learning Alexander Bastounis et.al. 2309.07072v1 null
2023-09-13 Aggregating Long-term Sharp Features via Hybrid Transformers for Video Deblurring Dongwei Ren et.al. 2309.07054v1 link
2023-09-13 Thurston's theorem and the Nielsen-Thurston classification via Teichmüller's theorem James Belk et.al. 2309.06993v1 null
2023-09-13 Neural network-based coronary dominance classification of RCA angiograms Ivan Kruzhilov et.al. 2309.06958v1 null
2023-09-12 Learning Disentangled Avatars with Hybrid 3D Representations Yao Feng et.al. 2309.06441v1 null
2023-09-12 LEAP Hand: Low-Cost, Efficient, and Anthropomorphic Hand for Robot Learning Kenneth Shaw et.al. 2309.06440v1 null
2023-09-12 AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer Tao Ma et.al. 2309.06421v1 null
2023-09-12 Style2Fab: Functionality-Aware Segmentation for Fabricating Personalized 3D Models with Generative AI Faraz Faruqi et.al. 2309.06379v1 null
2023-09-12 Padding-free Convolution based on Preservation of Differential Characteristics of Kernels Kuangdai Leng et.al. 2309.06370v1 null
2023-09-12 Using Reed-Muller Codes for Classification with Rejection and Recovery Daniel Fentham et.al. 2309.06359v1 link
2023-09-12 Eccentric graph of trees and their Cartesian products Anita Arora et.al. 2309.06338v1 null
2023-09-12 Exploring Flat Minima for Domain Generalization with Large Learning Rates Jian Zhang et.al. 2309.06337v1 null
2023-09-12 Grounded Language Acquisition From Object and Action Imagery James Robert Kubricht et.al. 2309.06335v1 null
2023-09-12 Visualising Game Engine Subsystem Coupling Gabriel C. Ullmann et.al. 2309.06329v1 null
2023-09-11 Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips Yufei Ye et.al. 2309.05663v1 null
2023-09-11 From Capture to Display: A Survey on Volumetric Video Yili Jin et.al. 2309.05658v1 null
2023-09-11 Potentials of Deterministic Radio Propagation Simulation for AI-Enabled Localization and Sensing Albrecht Michler et.al. 2309.05650v1 null
2023-09-11 A Novel Supervised Deep Learning Solution to Detect Distributed Denial of Service (DDoS) attacks on Edge Systems using Convolutional Neural Networks (CNN) Vedanth Ramanathan et.al. 2309.05646v1 null
2023-09-11 Boundary Peeling: Outlier Detection Method Using One-Class Peeling Sheikh Arafat et.al. 2309.05630v1 null
2023-09-11 Temporal Action Localization with Enhanced Instant Discriminability Dingfeng Shi et.al. 2309.05590v1 link
2023-09-11 Anisotropic Diffusion Stencils: From Simple Derivations over Stability Estimates to ResNet Implementations Karl Schrader et.al. 2309.05575v1 null
2023-09-11 On the Meromorphic Integrability of the Critical Systems for Optimal Sums of Eigenvalues Yuzhou Tian et.al. 2309.05568v1 null
2023-09-11 OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data Giuseppe Cartella et.al. 2309.05551v1 link
2023-09-11 Distance-Aware eXplanation Based Learning Misgina Tsighe Hagos et.al. 2309.05548v1 link
2023-09-08 Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays Aroof Aimen et.al. 2309.04462v1 null
2023-09-08 Generalized Variable Selection Algorithms for Gaussian Process Models by LASSO-like Penalty Zhiyong Hu et.al. 2309.04455v1 null
2023-09-08 Vis-SPLIT: Interactive Hierarchical Modeling for mRNA Expression Classification Braden Roper et.al. 2309.04423v1 null
2023-09-08 Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving Thomas E. Huang et.al. 2309.04422v1 null
2023-09-08 Seeing-Eye Quadruped Navigation with Force Responsive Locomotion Control David DeFazio et.al. 2309.04370v1 null
2023-09-08 Active Learning for Classifying 2D Grid-Based Level Completability Mahsa Bazzaz et.al. 2309.04367v1 link
2023-09-08 Sparse Codesigned Communication and Radar Systems Hyeon Seok Rou et.al. 2309.04362v1 null
2023-09-08 Learning from Power Signals: An Automated Approach to Electrical Disturbance Identification Within a Power Transmission System Jonathan D. Boyd et.al. 2309.04361v1 null
2023-09-08 Zero-Shot Robustification of Zero-Shot Models With Foundation Models Dyah Adila et.al. 2309.04344v1 null
2023-09-08 Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens Ronald Seoh et.al. 2309.04333v1 link
2023-09-07 A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation Ziyan Huang et.al. 2309.03906v1 link
2023-09-07 ImageBind-LLM: Multi-modality Instruction Tuning Jiaming Han et.al. 2309.03905v1 link
2023-09-07 Tracking Anything with Decoupled Video Segmentation Ho Kei Cheng et.al. 2309.03903v1 link
2023-09-07 Learning Continuous Exposure Value Representations for Single-Image HDR Reconstruction Su-Kai Chen et.al. 2309.03900v1 null
2023-09-07 The Making and Breaking of Camouflage Hala Lamdouar et.al. 2309.03899v1 null
2023-09-07 ProPainter: Improving Propagation and Transformer for Video Inpainting Shangchen Zhou et.al. 2309.03897v1 null
2023-09-07 Zero-Shot Audio Captioning via Audibility Guidance Tal Shaharabany et.al. 2309.03884v1 null
2023-09-07 Text-to-feature diffusion for audio-visual few-shot learning Otniel-Bogdan Mercea et.al. 2309.03869v1 null
2023-09-07 Classification of Killing Magnetic Curves In H^3 Özgür Kelekçi et.al. 2309.03859v1 null
2023-09-07 CenTime: Event-Conditional Modelling of Censoring in Survival Analysis Ahmed H. Shahin et.al. 2309.03851v1 link
2023-09-07 Terahertz-Band Direction Finding With Beam-Split and Mutual Coupling Calibration Ahmet M. Elbir et.al. 2309.03195v2 null
2023-09-06 Signatures of Bayesian inference emerge from energy efficient synapses James Malkin et.al. 2309.03194v1 null
2023-09-06 3D Transformer based on deformable patch location for differential diagnosis between Alzheimer's disease and Frontotemporal dementia Huy-Dung Nguyen et.al. 2309.03183v1 null
2023-09-06 PDiscoNet: Semantically consistent part discovery for fine-grained recognition Robert van der Klis et.al. 2309.03173v1 null
2023-09-06 ResFields: Residual Neural Fields for Spatiotemporal Signals Marko Mihajlovic et.al. 2309.03160v1 null
2023-09-06 Normal mode decomposition of atomic motion in solids Jaeyun Moon et.al. 2309.03140v1 null
2023-09-06 Serving Time: Real-Time, Safe Motion Planning and Control for Manipulation of Unsecured Objects Zachary Brei et.al. 2309.03111v1 null
2023-09-06 The Secrets of Non-Blind Poisson Deconvolution Abhiram Gnanasambandam et.al. 2309.03105v1 null
2023-09-06 On the $Σ$-invariants of Artin groups satisfying the $K(π,1)$-conjecture Marcos Escartín Ferrer et.al. 2309.03091v1 null
2023-09-06 Hide and Seek (HaS): A Lightweight Framework for Prompt Privacy Protection Yu Chen et.al. 2309.03057v1 null
2023-09-05 ReliTalk: Relightable Talking Portrait Generation from a Single Video Haonan Qiu et.al. 2309.02434v1 link
2023-09-05 A Likelihood Approach to Incorporating Self-Report Data in HIV Recency Classification Wenlong Yang et.al. 2309.02430v1 null
2023-09-05 Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach Vimal K B et.al. 2309.02429v1 null
2023-09-05 EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding Yue Xu et.al. 2309.02423v1 null
2023-09-05 Doppelgangers: Learning to Disambiguate Images of Similar Structures Ruojin Cai et.al. 2309.02420v1 link
2023-09-05 Classification of La3+ and Gd3+ rare earth ions using surface-enhanced Raman scattering Hao Jin et.al. 2309.02409v1 null
2023-09-05 Semantic Communications Based on Adaptive Generative Models and Information Bottleneck S. Barbarossa et.al. 2309.02387v1 null
2023-09-05 On the classification of primitive ideals for complex classical Lie algebras, IV William McGovern et.al. 2309.02363v1 null
2023-09-05 Generating Infinite-Resolution Texture using GANs with Patch-by-Patch Paradigm Alhasan Abdellatif et.al. 2309.02340v1 null
2023-09-05 DEEPBEAS3D: Deep Learning and B-Spline Explicit Active Surfaces Helena Williams et.al. 2309.02335v1 null
2023-09-01 Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following Ziyu Guo et.al. 2309.00615v1 link
2023-09-01 Amyloid-Beta Axial Plane PET Synthesis from Structural MRI: An Image Translation Approach for Screening Alzheimer's Disease Fernando Vega et.al. 2309.00569v1 null
2023-09-01 Powder-Bot: A Modular Autonomous Multi-Robot Workflow for Powder X-Ray Diffraction Amy M. Lunt et.al. 2309.00544v1 null
2023-09-01 A Machine Vision Method for Correction of Eccentric Error: Based on Adaptive Enhancement Algorithm Fanyi Wang et.al. 2309.00514v1 null
2023-09-01 Multi-stage Deep Learning Artifact Reduction for Computed Tomography Jiayang Shi et.al. 2309.00494v1 null
2023-09-01 Geometry-aware Line Graph Transformer Pre-training for Molecular Property Prediction Peizhen Bai et.al. 2309.00483v1 null
2023-09-01 Deep Joint Source-Channel Coding for Adaptive Image Transmission over MIMO Channels Haotian Wu et.al. 2309.00470v1 null
2023-09-01 New metrics for analyzing continual learners Nicolas Michel et.al. 2309.00462v1 null
2023-09-01 The miniJPAS survey quasar selection IV: Classification and redshift estimation with SQUEzE Ignasi Pérez-Ràfols et.al. 2309.00461v1 null
2023-09-01 CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding Étienne Labbé et.al. 2309.00454v1 link
2023-08-31 PointLLM: Empowering Large Language Models to Understand Point Clouds Runsen Xu et.al. 2308.16911v1 link
2023-08-31 StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation Yuhan Wang et.al. 2308.16909v1 link
2023-08-31 Learning to Taste: A Multimodal Wine Dataset Thoranna Bender et.al. 2308.16900v1 null
2023-08-31 EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild Manuel Kaufmann et.al. 2308.16894v1 link
2023-08-31 On the Role of Non-Localities in Fundamental Diagram Estimation Jing Liu et.al. 2308.16878v1 null
2023-08-31 SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation Jiaben Chen et.al. 2308.16876v1 null
2023-08-31 Understanding defects in amorphous silicon with million-atom simulations and machine learning Joe D. Morrow et.al. 2308.16868v1 null
2023-08-31 Self-pruning Graph Neural Network for Predicting Inflammatory Disease Activity in Multiple Sclerosis from Brain MR Images Chinmay Prabhakar et.al. 2308.16863v1 link
2023-08-31 Facing Unknown: Open-World Encrypted Traffic Classification Based on Contrastive Pre-Training Xiang Li et.al. 2308.16861v1 null
2023-08-31 Majorization-Minimization for sparse SVMs Alessandro Benfenati et.al. 2308.16858v1 null
2023-08-30 Fully Non-Linear Neuromorphic Computing with Linear Wave Scattering Clara C. Wanjura et.al. 2308.16181v1 null
2023-08-30 General Purpose Audio Effect Removal Matthew Rice et.al. 2308.16177v1 null
2023-08-30 Algebraic, Topological, and Mereological Foundations of Existential Granules Mani A et.al. 2308.16157v1 null
2023-08-31 MMVP: Motion-Matrix-based Video Prediction Yiqi Zhong et.al. 2308.16154v2 link
2023-08-30 Modality Cycles with Masked Conditional Diffusion for Unsupervised Anomaly Segmentation in MRI Ziyun Liang et.al. 2308.16150v1 null
2023-08-30 Spatial Graph Coarsening: Weather and Weekday Prediction with London's Bike-Sharing Service using GNN Yuta Sato et.al. 2308.16122v1 null
2023-08-30 Learned Image Reasoning Prior Penetrates Deep Unfolding Network for Panchromatic and Multi-Spectral Image Fusion Man Zhou et.al. 2308.16083v1 null
2023-08-30 A Classification of Observation-Driven State-Space Count Models for Panel Data Jae Youn Ahn et.al. 2308.16058v1 null
2023-08-30 Low-Rank Multitask Learning based on Tensorized SVMs and LSSVMs Jiani Liu et.al. 2308.16056v1 null
2023-08-30 Telepresence Lantern -- Designing an Immersive Video-Mediated Communication Device for Older Adults Thomas H. Weisswange et.al. 2308.16052v1 null
2023-08-29 An Adaptive Tangent Feature Perspective of Neural Networks Daniel LeJeune et.al. 2308.15478v1 null
2023-08-29 A General-Purpose Self-Supervised Model for Computational Pathology Richard J. Chen et.al. 2308.15474v1 null
2023-08-29 Learning Modulated Transformation in GANs Ceyuan Yang et.al. 2308.15472v1 null
2023-08-30 Policy composition in reinforcement learning via multi-objective policy optimization Shruti Mishra et.al. 2308.15470v2 null
2023-08-29 Input margins can predict generalization too Coenraad Mouton et.al. 2308.15466v1 null
2023-08-29 A Comparative Study of Loss Functions: Traffic Predictions in Regular and Congestion Scenarios Yangxinyu Xie et.al. 2308.15464v1 link
2023-08-29 Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection Yazhou Xing et.al. 2308.15462v1 null
2023-08-29 From SMOTE to Mixup for Deep Imbalanced Classification Wei-Chao Cheng et.al. 2308.15457v1 link
2023-08-29 Pseudo-Boolean Polynomials Approach To Edge Detection And Image Segmentation Tendai Mapungwana Chikake et.al. 2308.15453v1 null
2023-08-29 WrappingNet: Mesh Autoencoder via Deep Sphere Deformation Eric Lei et.al. 2308.15413v1 null
2023-08-28 MagicEdit: High-Fidelity and Temporally Coherent Video Editing Jun Hao Liew et.al. 2308.14749v1 null
2023-08-28 MagicAvatar: Multimodal Avatar Generation and Animation Jianfeng Zhang et.al. 2308.14748v1 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746v1 link
2023-08-28 Total Selfie: Generating Full-Body Selfies Bowei Chen et.al. 2308.14740v1 null
2023-08-28 PanoSwin: a Pano-style Swin Transformer for Panorama Understanding Zhixin Ling et.al. 2308.14726v1 null
2023-08-28 VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation Xudong Wang et.al. 2308.14710v1 link
2023-08-28 Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts Thanh Thi Nguyen et.al. 2308.14683v1 null
2023-08-28 Video-Based Hand Pose Estimation for Remote Assessment of Bradykinesia in Parkinson's Disease Gabriela T. Acevedo Trebbau et.al. 2308.14679v1 null
2023-08-28 Noncommutative tensor triangular geometry: classification via noetherian spectra James Rowe et.al. 2308.14661v1 null
2023-08-28 Towards Standardized Disturbance Rejection Testing of Legged Robot Locomotion with Linear Impactor: A Preliminary Study, Observations, and Implications Bowen Weng et.al. 2308.14636v1 null
2023-08-25 Unveiling the Role of Message Passing in Dual-Privacy Preservation on GNNs Tianyi Zhao et.al. 2308.13513v1 null
2023-08-25 Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation Jiaming Zhang et.al. 2308.13505v1 null
2023-08-25 Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-task Learning Pranav Balaji et.al. 2308.13503v1 null
2023-08-25 Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers Matthew Dutson et.al. 2308.13494v1 link
2023-08-25 Temporal Uncertainty Localization to Enable Human-in-the-loop Analysis of Dynamic Contrast-enhanced Cardiac MRI Datasets Dilek M. Yalcinkaya et.al. 2308.13488v1 null
2023-08-25 QKSAN: A Quantum Kernel Self-Attention Network Ren-Xin Zhao et.al. 2308.13422v1 null
2023-08-25 An investigation into the impact of deep learning model choice on sex and race bias in cardiac MR segmentation Tiarna Lee et.al. 2308.13415v1 null
2023-08-25 Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features Zheng Gao et.al. 2308.13392v1 null
2023-08-25 Direction-aware Video Demoireing with Temporal-guided Bilateral Learning Shuning Xu et.al. 2308.13388v1 null
2023-08-25 On flags of holomorphic foliations associated with singular second-order ordinary differential equations Fernando Lourenço et.al. 2308.13370v1 null
2023-08-24 POCO: 3D Pose and Shape Estimation with Confidence Sai Kumar Dwivedi et.al. 2308.12965v1 null
2023-08-24 Motion-Guided Masking for Spatiotemporal Representation Learning David Fan et.al. 2308.12962v1 null
2023-08-24 Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment Sheng Zhang et.al. 2308.12960v1 link
2023-08-24 Beyond Document Page Classification: Design, Datasets, and Challenges Jordy Van Landeghem et.al. 2308.12896v1 null
2023-08-24 Large Language Models Vote: Prompting for Rare Disease Identification David Oniani et.al. 2308.12890v1 link
2023-08-24 Multi-stage feature decorrelation constraints for improving CNN classification performance Qiuyu Zhu et.al. 2308.12880v1 null
2023-08-24 ToonTalker: Cross-Domain Face Reenactment Yuan Gong et.al. 2308.12866v1 null
2023-08-24 Learned Local Attention Maps for Synthesising Vessel Segmentations Yash Deo et.al. 2308.12861v1 null
2023-08-24 Algebraicity of hypergeometric functions with arbitrary parameters Florian Fürnsinn et.al. 2308.12855v1 null
2023-08-24 $p$-brane Galilean and Carrollian Geometries and Gravities Eric Bergshoeff et.al. 2308.12852v1 null
2023-08-23 Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models Nancy Tyagi et.al. 2308.12272v1 null
2023-08-23 Bugsplainer: Leveraging Code Structures to Explain Software Bugs with Neural Machine Translation Parvez Mahbub et.al. 2308.12267v1 null
2023-08-23 SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation Qing Xu et.al. 2308.12231v1 link
2023-08-23 Towards Real-Time Analysis of Broadcast Badminton Videos Nitin Nilesh et.al. 2308.12199v1 null
2023-08-23 Sign Language Translation with Iterative Prototype Huijie Yao et.al. 2308.12191v1 null
2023-08-23 Tumor-Centered Patching for Enhanced Medical Image Segmentation Mutyyba Asghar et.al. 2308.12168v1 null
2023-08-23 Constant mean curvature hypersurfaces in Anti-de Sitter space Enrico Trebeschi et.al. 2308.12167v1 null
2023-08-23 NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos Ziyu Yang et.al. 2308.12163v1 null
2023-08-23 A Probabilistic Fluctuation based Membership Inference Attack for Generative Models Wenjie Fu et.al. 2308.12143v1 null
2023-08-23 Masking Strategies for Background Bias Removal in Computer Vision Models Ananthu Aniraj et.al. 2308.12127v1 link
2023-08-22 StoryBench: A Multifaceted Benchmark for Continuous Story Visualization Emanuele Bugliarello et.al. 2308.11606v1 link
2023-08-22 Semantic Multi-Resolution Communications Matin Mortaheb et.al. 2308.11604v1 null
2023-08-22 EndoNet: model for automatic calculation of H-score on histological slides Egor Ushakov et.al. 2308.11562v1 null
2023-08-22 Open Set Synthetic Image Source Attribution Shengbang Fang et.al. 2308.11557v1 null
2023-08-22 Multi-event Video-Text Retrieval Gengyuan Zhang et.al. 2308.11551v1 link
2023-08-22 Furnishing Sound Event Detection with Language Model Abilities Hualei Wang et.al. 2308.11530v1 null
2023-08-22 LCCo: Lending CLIP to Co-Segmentation Xin Duan et.al. 2308.11506v1 null
2023-08-23 Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition Qitong Wang et.al. 2308.11489v2 link
2023-08-22 Opening the Vocabulary of Egocentric Actions Dibyadip Chatterjee et.al. 2308.11488v1 null
2023-08-22 Free Lunch for Gait Recognition: A Novel Relation Descriptor Jilong Wang et.al. 2308.11487v1 null
2023-08-21 Structured World Models from Human Videos Russell Mendonca et.al. 2308.10901v1 null
2023-08-21 Unlocking Accuracy and Fairness in Differentially Private Image Classification Leonard Berrada et.al. 2308.10888v1 null
2023-08-21 Evaluating quantum generative models via imbalanced data classification benchmarks Graham R. Enos et.al. 2308.10847v1 null
2023-08-21 Pixel Adaptive Deep Unfolding Transformer for Hyperspectral Image Reconstruction Miaoyu Li et.al. 2308.10820v1 null
2023-08-21 Improving Continuous Sign Language Recognition with Cross-Lingual Signs Fangyun Wei et.al. 2308.10809v1 null
2023-08-21 DynED: Dynamic Ensemble Diversification in Data Stream Classification Soheil Abadifard et.al. 2308.10807v1 link
2023-08-21 MGMAE: Motion Guided Masking for Video Masked Autoencoding Bingkun Huang et.al. 2308.10794v1 null
2023-08-21 Extraction of Text from Optic Nerve Optical Coherence Tomography Reports Iyad Majid et.al. 2308.10790v1 null
2023-08-21 Dense Error Map Estimation for MRI-Ultrasound Registration in Brain Tumor Surgery Using Swin UNETR Soorena Salari et.al. 2308.10784v1 null
2023-08-21 Superfluid weight in the isolated band limit within the generalized random phase approximation Minh Tam et.al. 2308.10780v1 null
2023-08-18 Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization Soumik Mukhopadhyay et.al. 2308.09716v1 link
2023-08-18 Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis Jonathon Luiten et.al. 2308.09713v1 null
2023-08-18 SimDA: Simple Diffusion Adapter for Efficient Video Generation Zhen Xing et.al. 2308.09710v1 null
2023-08-18 Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition Xuanyu Yi et.al. 2308.09694v1 null
2023-08-18 A Lightweight Transformer for Faster and Robust EBSD Data Collection Harry Dong et.al. 2308.09693v1 link
2023-08-18 Audiovisual Moments in Time: A Large-Scale Annotated Dataset of Audiovisual Actions Michael Joannou et.al. 2308.09685v1 link
2023-08-18 Quantifying Uncertainties of Contact Classifications in a Human-Robot Collaboration with Parallel Robots Aran Mohammad et.al. 2308.09675v1 null
2023-08-18 Classification of modular data up to rank 11 Siu-Hung Ng et.al. 2308.09670v1 null
2023-08-18 Collision Isolation and Identification Using Proprioceptive Sensing for Parallel Robots to Enable Human-Robot Collaboration Aran Mohammad et.al. 2308.09650v1 null
2023-08-18 Robust Uncertainty Quantification using Conformalised Monte Carlo Prediction Daniel Bethell et.al. 2308.09647v1 link
2023-08-16 MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions Henghui Ding et.al. 2308.08544v1 link
2023-08-16 Deployment and Analysis of Instance Segmentation Algorithm for In-field Grade Estimation of Sweetpotatoes Hoang M. Nguyen et.al. 2308.08534v1 null
2023-08-16 Diagnosing Human-object Interaction Detectors Fangrui Zhu et.al. 2308.08529v1 link
2023-08-17 Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction Yuhao Yang et.al. 2308.08518v2 null
2023-08-17 Two-and-a-half Order Score-based Model for Solving 3D Ill-posed Inverse Problems Zirong Li et.al. 2308.08511v2 null
2023-08-16 ResBuilder: Automated Learning of Depth with Residual Structures Julian Burghoff et.al. 2308.08504v1 null
2023-08-16 Galactic Archaeology: Tracing the Milky Way's Formation and Evolution through Stellar Populations J. Alfredo Collazos et.al. 2308.08492v1 null
2023-08-16 Label Propagation Techniques for Artifact Detection in Imbalanced Classes using Photoplethysmogram Signals Clara Macabiau et.al. 2308.08480v1 null
2023-08-16 DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479v1 link
2023-08-16 Classification Committee for Active Deep Object Detection Lei Zhao et.al. 2308.08476v1 null
2023-08-15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Hao Ouyang et.al. 2308.07926v1 link
2023-08-15 Helping Hands: An Object-Aware Ego-Centric Video Recognition Model Chuhan Zhang et.al. 2308.07918v1 link
2023-08-15 Relightable and Animatable Neural Avatar from Sparse-View Video Zhen Xu et.al. 2308.07903v1 null
2023-08-15 Back to Basics: A Sanity Check on Modern Time Series Classification Algorithms Bhaskar Dhariyal et.al. 2308.07886v1 link
2023-08-15 The Challenge of Fetal Cardiac MRI Reconstruction Using Deep Learning Denis Prokopenko et.al. 2308.07885v1 null
2023-08-15 Towards Temporal Edge Regression: A Case Study on Agriculture Trade Between Nations Lekang Jiang et.al. 2308.07883v1 link
2023-08-15 Synthesizing Political Zero-Shot Relation Classification via Codebook Knowledge, NLI, and ChatGPT Yibo Hu et.al. 2308.07876v1 null
2023-08-15 SEDA: Self-Ensembling ViT with Defensive Distillation and Adversarial Training for robust Chest X-rays Classification Raza Imam et.al. 2308.07874v1 link
2023-08-15 Sequence Processing with Quantum Tensor Networks Carys Harvey et.al. 2308.07865v1 null
2023-08-15 ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition Yixuan Zhou et.al. 2308.07815v1 link
2023-08-14 Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification Olesya Razuvayevskaya et.al. 2308.07282v1 null
2023-08-14 A Robust Approach Towards Distinguishing Natural and Computer Generated Images using Multi-Colorspace fused and Enriched Vision Transformer Manjary P Gangan et.al. 2308.07279v1 null
2023-08-14 EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models Peng Wang et.al. 2308.07269v1 link
2023-08-14 Diving with Penguins: Detecting Penguins and their Prey in Animal-borne Underwater Videos via Deep Learning Kejia Zhang et.al. 2308.07267v1 null
2023-08-14 Large-kernel Attention for Efficient and Robust Brain Lesion Segmentation Liam Chalcroft et.al. 2308.07251v1 link
2023-08-14 LCE -- An Augmented Combination of Bagging and Boosting in Python Kevin Fauvel et.al. 2308.07250v1 link
2023-08-14 Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation Tao Liu et.al. 2308.07231v1 null
2023-08-14 Almost fine gradings on algebras and classification of gradings up to isomorphism Alberto Elduque et.al. 2308.07230v1 null
2023-08-14 Distance Matters For Improving Performance Estimation Under Covariate Shift Mélanie Roschewitz et.al. 2308.07223v1 link
2023-08-15 AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes Zhaohui Li et.al. 2308.07221v2 link
2023-08-11 ARGUS: Visualization of AI-Assisted Task Guidance in AR Sonia Castelo et.al. 2308.06246v1 null
2023-08-11 Exploring Predicate Visual Context in Detecting of Human-Object Interactions Frederic Z. Zhang et.al. 2308.06202v1 link
2023-08-11 Weakly Supervised Text Classification on Free Text Comments in Patient-Reported Outcome Measures Anna-Grace Linton et.al. 2308.06199v1 null
2023-08-11 Physical Adversarial Attacks For Camera-based Smart Systems: Current Trends, Categorization, Applications, Research Challenges, and Future Outlook Amira Guesmi et.al. 2308.06173v1 null
2023-08-11 Extrinsic geometry and linear differential equations of $\mathfrak{sl}_3$-type Boris Doubrov et.al. 2308.06169v1 null
2023-08-11 Rethinking the Localization in Weakly Supervised Object Localization Rui Xu et.al. 2308.06161v1 null
2023-08-11 Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based Models Sruthi S et.al. 2308.06144v1 link
2023-08-11 Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping Yasser Abdelaziz Dahou Djilali et.al. 2308.06112v1 null
2023-08-11 Diffusion-based Visual Counterfactual Explanations -- Towards Systematic Quantitative Evaluation Philipp Vaeth et.al. 2308.06100v1 link
2023-08-11 Automated Construction of Time-Space Diagrams for Traffic Analysis Using Street-View Video Sequence Tanay Rastogi et.al. 2308.06098v1 null
2023-08-10 Follow Anything: Open-set detection, tracking, and following in real-time Alaa Maalouf et.al. 2308.05737v1 link
2023-08-10 FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models Guangkai Xu et.al. 2308.05733v1 null
2023-08-10 Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions Chinmay Rane et.al. 2308.05724v1 null
2023-08-10 Towards the Automorphism Conjecture I: Combinatorial Control and Compensation for Factorials Bernd S. W. Schröder et.al. 2308.05715v1 null
2023-08-10 Automatic Extraction of Relevant Road Infrastructure using Connected vehicle data and Deep Learning Model Adu-Gyamfi Kojo et.al. 2308.05658v1 null
2023-08-10 Attention-based 3D CNN with Multi-layer Features for Alzheimer's Disease Diagnosis using Brain Images Yanteng Zhang et.al. 2308.05655v1 null
2023-08-10 Counterfactual Cross-modality Reasoning for Weakly Supervised Video Moment Localization Zezhong Lv et.al. 2308.05648v1 link
2023-08-10 Self-Supervised Monocular Depth Estimation by Direction-aware Cumulative Convolution Network Wencheng Han et.al. 2308.05605v1 link
2023-08-10 Object Goal Navigation with Recursive Implicit Maps Shizhe Chen et.al. 2308.05602v1 null
2023-08-10 You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content Xinlei He et.al. 2308.05596v1 null
2023-08-09 Improved Multi-Shot Diffusion-Weighted MRI with Zero-Shot Self-Supervised Learning Reconstruction Jaejin Cho et.al. 2308.05103v1 link
2023-08-09 DOST -- Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels Soumadeep Saha et.al. 2308.05101v1 null
2023-08-09 Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling Yu Zhao et.al. 2308.05081v1 null
2023-08-10 Geometric Learning-Based Transformer Network for Estimation of Segmentation Errors Sneha Sree C et.al. 2308.05068v2 null
2023-08-09 PAT: Position-Aware Transformer for Dense Multi-Label Action Detection Faegheh Sardari et.al. 2308.05051v1 null
2023-08-09 Collaborative Wideband Spectrum Sensing and Scheduling for Networked UAVs in UTM Systems Sravan Reddy Chintareddy et.al. 2308.05036v1 null
2023-08-09 Expert load matters: operating networks at high accuracy and low manual effort Sara Sangalli et.al. 2308.05035v1 null
2023-08-09 MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model Michael Leong et.al. 2308.05012v1 null
2023-08-09 Exploring Multilingual Text Data Distillation Shivam Sahni et.al. 2308.04982v1 link
2023-08-09 CasCIFF: A Cross-Domain Information Fusion Framework Tailored for Cascade Prediction in Social Networks Hongjun Zhu et.al. 2308.04961v1 null
2023-08-08 A Deep-Learning Method Using Auto-encoder and Generative Adversarial Network for Anomaly Detection on Ancient Stone Stele Surfaces Yikun Liu et.al. 2308.04426v1 null
2023-08-08 A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition Li Zheng et.al. 2308.04424v1 null
2023-08-08 DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images Xuechao Zou et.al. 2308.04417v1 null
2023-08-08 Probabilistic Invariant Learning with Randomized Linear Classifiers Leonardo Cotta et.al. 2308.04412v1 null
2023-08-08 Data Augmentation-Based Unsupervised Domain Adaptation In Medical Imaging Sebastian Nørgaard Llambias et.al. 2308.04395v1 null
2023-08-08 SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition Xiao Wang et.al. 2308.04369v1 link
2023-08-08 Vascular Ageing and Smoking Habit Prediction via a Low-Cost Single-Lead ECG Module S. Anas Ali et.al. 2308.04355v1 null
2023-08-08 A Lightweight and Accurate Face Detection Algorithm Based on Retinaface Baozhu Liu et.al. 2308.04340v1 null
2023-08-08 Pengembangan Model untuk Mendeteksi Kerusakan pada Terumbu Karang dengan Klasifikasi Citra Fadhil Muhammad et.al. 2308.04337v1 null
2023-08-08 Embracing Safe Contacts with Contact-aware Planning and Control Zhaoting Li et.al. 2308.04323v1 null
2023-08-07 3D Motion Magnification: Visualizing Subtle Motions with Time Varying Radiance Fields Brandon Y. Feng et.al. 2308.03757v1 null
2023-08-07 What about translation? New coding system for content analysis on the perception of literary translation around the political transformation in 1989 in Hungary as a classification problem on an unbalanced dataset Dalma Galambos et.al. 2308.03742v1 null
2023-08-07 Efficient Temporal Sentence Grounding in Videos with Multi-Teacher Knowledge Distillation Renjie Liang et.al. 2308.03725v1 null
2023-08-07 Automated Real Time Delineation of Supraclavicular Brachial Plexus in Neck Ultrasonography Videos: A Deep Learning Approach Abhay Tyagi et.al. 2308.03717v1 null
2023-08-08 Communication-Efficient Framework for Distributed Image Semantic Wireless Transmission Bingyan Xie et.al. 2308.03713v2 null
2023-08-07 Scaling may be all you need for achieving human-level object recognition capacity with human-like visual experience A. Emin Orhan et.al. 2308.03712v1 link
2023-08-07 Video-based Person Re-identification with Long Short-Term Representation Learning Xuehu Liu et.al. 2308.03703v1 null
2023-08-08 Screen-based 3D Subjective Experiment Software Songlin Fan et.al. 2308.03698v2 null
2023-08-07 Learning Concise and Descriptive Attributes for Visual Recognition An Yan et.al. 2308.03685v1 null
2023-08-07 Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence Marcel Moravek et.al. 2308.03660v1 null
2023-08-04 Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP Qihang Yu et.al. 2308.02487v1 link
2023-08-04 BlindSage: Label Inference Attacks against Node-level Vertical Federated Graph Neural Networks Marco Arazzi et.al. 2308.02465v1 null
2023-08-04 Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration Juan Del Aguila Ferrandis et.al. 2308.02459v1 null
2023-08-04 Getting the Ball Rolling: Learning a Dexterous Policy for a Biomimetic Tendon-Driven Hand with Rolling Contact Joints Yasunori Toshimitsu et.al. 2308.02453v1 null
2023-08-04 Adaptive Preferential Attached kNN Graph With Distribution-Awareness Shaojie Min et.al. 2308.02442v1 link
2023-08-04 Scaling Survival Analysis in Healthcare with Federated Survival Forests: A Comparative Study on Heart Failure and Breast Cancer Genomics Alberto Archetti et.al. 2308.02382v1 null
2023-08-04 Brain MRI Segmentation using Template-Based Training and Visual Perception Augmentation Fang-Cheng Yeh et.al. 2308.02363v1 null
2023-08-04 T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images Huan Zhong et.al. 2308.02356v1 link
2023-08-04 Adapting to Change: Robust Counterfactual Explanations in Dynamic Data Landscapes Bardh Prenkaj et.al. 2308.02353v1 link
2023-08-04 Generative Image Priors for MRI Reconstruction Trained from Magnitude-Only Images Guanxiong Luo et.al. 2308.02340v1 null
2023-08-03 FROD: Robust Object Detection for Free Muhammad et.al. 2308.01888v1 null
2023-08-03 Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies Eunsuk Seo et.al. 2308.01871v1 null
2023-08-03 Tag Prediction of Competitive Programming Problems using Deep Learning Techniques Taha Lokat et.al. 2308.01863v1 null
2023-08-03 URET: Universal Robustness Evaluation Toolkit (for Evasion) Kevin Eykholt et.al. 2308.01840v1 link
2023-08-03 Distribution-Free Inference for the Regression Function of Binary Classification Ambrus Tamás et.al. 2308.01835v1 null
2023-08-03 Deep Neural Networks Fused with Textures for Image Classification Asish Bera et.al. 2308.01813v1 null
2023-08-03 Deep Learning-based Prediction of Stress and Strain Maps in Arterial Walls for Improved Cardiovascular Risk Assessment Yasin Shokrollahi1 et.al. 2308.01771v1 null
2023-08-03 Focus on Content not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN Jonas Utz et.al. 2308.01769v1 null
2023-08-03 A Novel Tensor Decomposition of arbitrary order based on Block Convolution with Reflective Boundary Conditions for Multi-Dimensional Data Analysis Mahdi Molavi et.al. 2308.01768v1 null
2023-08-03 NuInsSeg: A Fully Annotated Dataset for Nuclei Instance Segmentation in H&E-Stained Histological Images Amirreza Mahbod et.al. 2308.01760v1 link
2023-08-02 ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders Shawn Xu et.al. 2308.01317v1 null
2023-08-02 More Context, Less Distraction: Visual Classification by Inferring and Conditioning on Contextual Attributes Bang An et.al. 2308.01313v1 link
2023-08-02 Revisiting DETR Pre-training for Object Detection Yan Ma et.al. 2308.01300v1 null
2023-08-02 A Probabilistic Approach to Self-Supervised Learning using Cyclical Stochastic Gradient MCMC Masoumeh Javanbakhat et.al. 2308.01271v1 null
2023-08-02 Incorporating Season and Solar Specificity into Renderings made by a NeRF Architecture using Satellite Images Michael Gableman et.al. 2308.01262v1 link
2023-08-02 Quantum Imprint of the Anharmonic Oscillator Prisco Lo Chiatto et.al. 2308.01244v1 null
2023-08-03 CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion Fenghe Tang et.al. 2308.01239v2 link
2023-08-02 LSF-IDM: Lightweight Deep Learning Models for Automotive Intrusion Detection Model Based on Semantic Fusion Pengzhou Cheng et.al. 2308.01237v1 null
2023-08-02 JADES. The diverse population of infant Black Holes at 4<z<11: merging, tiny, poor, but mighty Roberto Maiolino et.al. 2308.01230v1 null
2023-08-02 TeachCLIP: Multi-Grained Teaching for Efficient Text-to-Video Retrieval Kaibin Tian et.al. 2308.01217v1 null
2023-08-01 Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models Cheng-Yu Hsieh et.al. 2308.00675v1 null
2023-08-01 Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes Bohao Fan et.al. 2308.00628v1 link
2023-08-01 NeRT: Implicit Neural Representations for General Unsupervised Turbulence Mitigation Weiyun Jiang et.al. 2308.00622v1 null
2023-08-01 Beyond One-Hot-Encoding: Injecting Semantics to Drive Image Classifiers Alan Perotti et.al. 2308.00607v1 link
2023-08-01 Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities Kaijian Liu et.al. 2308.00588v1 null
2023-08-01 Gradient Scaling on Deep Spiking Neural Networks with Spike-Dependent Local Information Seongsik Park et.al. 2308.00558v1 null
2023-08-01 SF-IDS: An Imbalanced Semi-Supervised Learning Framework for Fine-grained Intrusion Detection Xinran Zheng et.al. 2308.00542v1 null
2023-08-01 Compressed Private Aggregation for Scalable and Robust Federated Learning over Massive Networks Natalie Lang et.al. 2308.00540v1 link
2023-08-01 Predicting Early Dropouts of an Active and Healthy Ageing App Vasileios Perifanis et.al. 2308.00539v1 null
2023-08-01 PressureTransferNet: Human Attribute Guided Dynamic Ground Pressure Profile Transfer using 3D simulated Pressure Maps Lala Shakti Swarup Ray et.al. 2308.00538v1 null
2023-07-31 A Quantized Interband Topological Index in Two-Dimensional Systems Tharindu Fernando et.al. 2307.16893v1 null
2023-07-31 Foundational Models for Fault Diagnosis of Electrical Motors Sriram Anbalagan et.al. 2307.16891v1 null
2023-07-31 Discovering Adaptable Symbolic Algorithms from Scratch Stephen Kelly et.al. 2307.16890v1 null
2023-07-31 Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models Weikang Yu et.al. 2307.16865v1 null
2023-07-31 Nonlinearity-induced topological phase transition characterized by the nonlinear Chern number Kazuki Sone et.al. 2307.16827v1 null
2023-07-31 Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection Xuanang Chen et.al. 2307.16816v1 null
2023-07-31 Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment Kun Yuan et.al. 2307.16813v1 null
2023-07-31 DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures Hannah Rose Kirk et.al. 2307.16811v1 null
2023-07-31 DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation Yue Zhang et.al. 2307.16803v1 null
2023-07-31 Classification with Deep Neural Networks and Logistic Loss Zihan Zhang et.al. 2307.16792v1 null
2023-07-28 Quantum-noise-limited optical neural networks operating at a few quanta per activation Shi-Yuan Ma et.al. 2307.15712v1 null
2023-07-31 MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking Ruopeng Gao et.al. 2307.15700v2 null
2023-07-28 PatchMixer: Rethinking network design to boost generalization for 3D point cloud understanding Davide Boscaini et.al. 2307.15692v1 null
2023-07-28 ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription Patrick Vossler et.al. 2307.15691v1 link
2023-07-28 Dynamic Analysis and an Eigen Initializer for Recurrent Neural Networks Ran Dou et.al. 2307.15679v1 null
2023-07-28 Bayesian Time-Series Classifier for Decoding Simple Visual Stimuli from Intracranial Neural Activity Navid Ziaei et.al. 2307.15672v1 null
2023-07-28 Classifying core collapse supernova remnants by their morphology as shaped by the last exploding jets Noam Soker et.al. 2307.15666v1 null
2023-07-28 Multi-layer Aggregation as a key to feature-based OOD detection Benjamin Lambert et.al. 2307.15647v1 null
2023-07-28 Scale-aware Test-time Click Adaptation for Pulmonary Nodule and Mass Segmentation Zhihao Li et.al. 2307.15645v1 link
2023-07-28 TriadNet: Sampling-free predictive intervals for lesional volume in 3D brain MR images Benjamin Lambert et.al. 2307.15638v1 null
2023-07-27 PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking Yang Zheng et.al. 2307.15055v1 null
2023-07-27 A Transformer-based Approach for Arabic Offline Handwritten Text Recognition Saleh Momeni et.al. 2307.15045v1 null
2023-07-27 Drive Asymmetry, Convergence and the Origin of Turbulence in ICF Implosions Vincent A. Thomas et.al. 2307.15028v1 null
2023-07-27 Self-Supervised Graph Transformer for Deepfake Detection Aminollah Khormali et.al. 2307.15019v1 null
2023-07-27 The last patch for classifying shuffle groups Junyang Zhang et.al. 2307.15012v1 null
2023-07-27 Gzip versus bag-of-words for text classification with KNN Juri Opitz et.al. 2307.15002v1 null
2023-07-27 Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs Or Sharir et.al. 2307.14988v1 null
2023-07-27 Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models Ziyi Wang et.al. 2307.14971v1 link
2023-07-27 Federated Model Aggregation via Self-Supervised Priors for Highly Imbalanced Medical Image Classification Marawan Elbatel et.al. 2307.14959v1 link
2023-07-27 Multi-Source Domain Adaptation through Dataset Dictionary Learning in Wasserstein Space Eduardo Fernandes Montesuma et.al. 2307.14953v1 null
2023-07-26 MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation Rajeev Yasarla et.al. 2307.14336v1 null
2023-07-26 Event-based Vision for Early Prediction of Manipulation Actions Daniel Deniz et.al. 2307.14332v1 null
2023-07-26 Waypoint-Based Imitation Learning for Robotic Manipulation Lucy Xiaoyang Shi et.al. 2307.14326v1 null
2023-07-26 Unraveling the Complexity of Splitting Sequential Data: Tackling Challenges in Video and Time Series Analysis Diego Botache et.al. 2307.14294v1 null
2023-07-26 G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory Hongxiang Li et.al. 2307.14277v1 null
2023-07-26 Deepfake Image Generation for Improved Brain Tumor Segmentation Roa'a Al-Emaryeen et.al. 2307.14273v1 null
2023-07-26 Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for Tactile Pushing Max Yang et.al. 2307.14272v1 null
2023-07-26 Artifact Restoration in Histology Images with Diffusion Probabilistic Models Zhenqi He et.al. 2307.14262v1 link
2023-07-26 Defending Adversarial Patches via Joint Region Localizing and Inpainting Junwen Chen et.al. 2307.14242v1 null
2023-07-26 DisguisOR: Holistic Face Anonymization for the Operating Room Lennart Bastian et.al. 2307.14241v1 link
2023-07-25 RED CoMETS: An ensemble classifier for symbolically represented multivariate time series Luca A. Bennett et.al. 2307.13679v1 link
2023-07-25 QuickQual: Lightweight, convenient retinal image quality scoring with off-the-shelf pretrained models Justin Engelmann et.al. 2307.13646v1 link
2023-07-25 Manifestly Covariant Worldline Actions from Coadjoint Orbits. Part I: Generalities and Vectorial Descriptions Thomas Basile et.al. 2307.13644v1 null
2023-07-25 Optical Flow boosts Unsupervised Localization and Segmentation Xinyu Zhang et.al. 2307.13640v1 link
2023-07-25 Insights into Cognitive Engagement: Comparing the Effectiveness of Game-Based and Video-Based Learning Shayla Sharmin et.al. 2307.13637v1 null
2023-07-25 Contributions to the Improvement of Question Answering Systems in the Biomedical Domain Mourad Sarrouti et.al. 2307.13631v1 null
2023-07-25 Chandra X-ray Observatory Observations of 13 Fermi LAT Sources Blagoy Rangelov et.al. 2307.13594v1 null
2023-07-25 Reinterpreting survival analysis in the universal approximator age Sören Dittmer et.al. 2307.13579v1 link
2023-07-25 PT$\mathrm{L}^{p}$: Partial Transport $\mathrm{L}^{p}$ Distances Xinran Liu et.al. 2307.13571v1 null
2023-07-25 Group Activity Recognition in Computer Vision: A Comprehensive Review, Challenges, and Future Perspectives Chuanchuan Wang et.al. 2307.13541v1 null
2023-07-24 Leveraging Label Variation in Large Language Models for Zero-Shot Text Classification Flor Miriam Plaza-del-Arco et.al. 2307.12973v1 null
2023-07-24 A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning Benjamin Eysenbach et.al. 2307.12968v1 link
2023-07-24 Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment Sarah Ibrahimi et.al. 2307.12964v1 null
2023-07-24 Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection Christopher Clarke et.al. 2307.12935v1 link
2023-07-25 Towards a Visual-Language Foundation Model for Computational Pathology Ming Y. Lu et.al. 2307.12914v2 null
2023-07-24 Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields Shangzhan Zhang et.al. 2307.12909v1 null
2023-07-24 Conditional Residual Coding: A Remedy for Bottleneck Problems in Conditional Inter Frame Coding Fabian Brand et.al. 2307.12864v1 null
2023-07-24 Multiscale Video Pretraining for Long-Term Activity Forecasting Reuben Tan et.al. 2307.12854v1 null
2023-07-25 Spatiotemporal Modeling Encounters 3D Medical Image Analysis: Slice-Shift UNet with Multi-View Fusion C. I. Ugwu et.al. 2307.12853v2 null
2023-07-24 Early Neuron Alignment in Two-layer ReLU Networks with Small Initialization Hancheng Min et.al. 2307.12851v1 null
2023-07-21 Advanced Monte Carlo simulation techniques to study polymers under equilibrium conditions Monika Angwani et.al. 2307.11722v1 null
2023-07-21 Deep Learning Hyperspectral Pansharpening on large scale PRISMA dataset Simone Zini et.al. 2307.11666v1 null
2023-07-21 FEDD -- Fair, Efficient, and Diverse Diffusion-based Lesion Segmentation and Malignancy Classification Héctor Carrión et.al. 2307.11654v1 null
2023-07-21 Sparse Cholesky factorization by greedy conditional selection Stephen Huan et.al. 2307.11648v1 link
2023-07-24 Morphological Image Analysis and Feature Extraction for Reasoning with AI-based Defect Detection and Classification Models Jiajun Zhang et.al. 2307.11643v2 null
2023-07-21 Deep Reinforcement Learning Based System for Intraoperative Hyperspectral Video Autofocusing Charlie Budd et.al. 2307.11638v1 null
2023-07-21 Computational Image Formation Stanley H. Chan et.al. 2307.11635v1 null
2023-07-21 Finding Optimal Diverse Feature Sets with Alternative Feature Selection Jakob Bach et.al. 2307.11607v1 null
2023-07-21 Cascaded multitask U-Net using topological loss for vessel segmentation and centerline extraction Pierre Rougé et.al. 2307.11603v1 null
2023-07-21 Mixbiotic society measures: Assessment of community well-going as living system Takeshi Kato et.al. 2307.11594v1 null
2023-07-20 GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos Nisarg A. Shah et.al. 2307.11081v1 link
2023-07-20 Driving Policy Prediction based on Deep Learning Models Fuxiao Liu et.al. 2307.11058v1 null
2023-07-20 Cascade-DETR: Delving into High-Quality Universal Object Detection Mingqiao Ye et.al. 2307.11035v1 link
2023-07-20 Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification Neel Guha et.al. 2307.11031v1 null
2023-07-20 Cluster-aware Semi-supervised Learning: Relational Knowledge Distillation Provably Learns Clustering Yijun Dong et.al. 2307.11030v1 null
2023-07-20 Multi-objective point cloud autoencoders for explainable myocardial infarction prediction Marcel Beetz et.al. 2307.11017v1 null
2023-07-20 Treatment And Follow-Up Guidelines For Multiple Brain Metastases: A Systematic Review Ana Sofia Santos et.al. 2307.11016v1 null
2023-07-21 Dense Sample Deep Learning Stephen Josè Hanson et.al. 2307.10991v2 null
2023-07-20 Deep Spiking-UNet for Image Processing Hebei Li et.al. 2307.10974v1 link
2023-07-20 Spinal nerve segmentation method and dataset construction in endoscopic surgical scenarios Shaowu Peng et.al. 2307.10955v1 link
2023-07-19 DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering Wei Cheng et.al. 2307.10173v1 link
2023-07-19 Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis Jiajie Fan et.al. 2307.10166v1 null
2023-07-19 Leveraging Visemes for Better Visual Speech Representation and Lip Reading Javad Peymanfard et.al. 2307.10157v1 null
2023-07-19 Remarks on a theorem of Pink in presence of bad reduction Wojciech Gajda et.al. 2307.10140v1 null
2023-07-19 Gradient Sparsification For Masked Fine-Tuning of Transformers James O' Neill et.al. 2307.10098v1 null
2023-07-19 Boundary-Refined Prototype Generation: A General End-to-End Paradigm for Semi-Supervised Semantic Segmentation Junhao Dong et.al. 2307.10097v1 null
2023-07-19 Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis Lingting Zhu et.al. 2307.10094v1 null
2023-07-19 Divert More Attention to Vision-Language Object Tracking Mingzhe Guo et.al. 2307.10046v1 link
2023-07-19 A non-monotone extra-gradient trust-region method with noisy oracles Natasa Krejic et.al. 2307.10038v1 null
2023-07-20 Class Attention to Regions of Lesion for Imbalanced Medical Image Recognition Jia-Xin Zhuang et.al. 2307.10036v2 null
2023-07-18 AnyDoor: Zero-shot Object-level Image Customization Xi Chen et.al. 2307.09481v1 null
2023-07-18 FACTS: Facial Animation Creation using the Transfer of Styles Jack Saunders et.al. 2307.09480v1 null
2023-07-18 GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping Zhuoling Li et.al. 2307.09472v1 null
2023-07-18 Smooth Attention for Deep Multiple Instance Learning: Application to CT Intracranial Hemorrhage Detection Yunan Wu et.al. 2307.09457v1 link
2023-07-19 A comparative analysis of SRGAN models Fatemeh Rezapoor Nikroo et.al. 2307.09456v2 null
2023-07-19 Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers Jaeyoung Kim et.al. 2307.09455v2 null
2023-07-18 Measuring Student Behavioral Engagement using Histogram of Actions Ahmed Abdelkawy et.al. 2307.09420v1 null
2023-07-18 Is this Snippet Written by ChatGPT? An Empirical Study with a CodeBERT-Based Classifier Phuong T. Nguyen et.al. 2307.09381v1 null
2023-07-18 CertPri: Certifiable Prioritization for Deep Neural Networks via Movement Cost in Feature Space Haibin Zheng et.al. 2307.09375v1 null
2023-07-18 Enhancing Pattern Classification in Support Vector Machines through Matrix Formulation Sambhav Jain Reshma Rastogi et.al. 2307.09372v1 null
2023-07-17 Diffusion Models Beat GANs on Image Classification Soumik Mukhopadhyay et.al. 2307.08702v1 null
2023-07-17 Neural Video Depth Stabilizer Yiran Wang et.al. 2307.08695v1 link
2023-07-17 SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation Vic De Ridder et.al. 2307.08693v1 null
2023-07-17 FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning Tri Dao et.al. 2307.08691v1 link
2023-07-17 Implementation of a perception system for autonomous vehicles using a detection-segmentation network in SoC FPGA Maciej Baczmanski et.al. 2307.08682v1 null
2023-07-17 Neural Image Compression: Generalization, Robustness, and Spectral Biases Kelsey Lieberman et.al. 2307.08657v1 null
2023-07-17 PolyGNN: Polyhedron-based Graph Neural Network for 3D Building Reconstruction from Point Clouds Zhaiyu Chen et.al. 2307.08636v1 null
2023-07-17 Deficiency-Aware Masked Transformer for Video Inpainting Yongsheng Yu et.al. 2307.08629v1 link
2023-07-17 BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs Yang Zhao et.al. 2307.08581v1 null
2023-07-18 Deep Learning with Passive Optical Nonlinear Mapping Fei Xia et.al. 2307.08558v2 null
2023-07-14 Expressive Monotonic Neural Networks Ouail Kitouni et.al. 2307.07512v1 link
2023-07-14 Streaming CTR Prediction: Rethinking Recommendation Task for Real-World Streaming Data Qi-Wei Wang et.al. 2307.07509v1 null
2023-07-14 Brain Tumor Detection using Convolutional Neural Networks with Skip Connections Aupam Hamran et.al. 2307.07503v1 null
2023-07-14 TALL: Thumbnail Layout for Deepfake Video Detection Yuting Xu et.al. 2307.07494v1 null
2023-07-14 DreamTeacher: Pretraining Image Backbones with Deep Generative Models Daiqing Li et.al. 2307.07487v1 null
2023-07-14 Multimodal Distillation for Egocentric Action Recognition Gorjan Radevski et.al. 2307.07483v1 null
2023-07-14 Dual-Query Multiple Instance Learning for Dynamic Meta-Embedding based Tumor Classification Simon Holdenried-Krafft et.al. 2307.07482v1 null
2023-07-14 Passage-times for partially-homogeneous reflected random walks on the quadrant Conrado da Costa et.al. 2307.07458v1 null
2023-07-14 An equivariant surgery classification of $C_p$-surfaces Kelly Pohland et.al. 2307.07446v1 null
2023-07-14 Can Large Language Models Empower Molecular Property Prediction? Chen Qian et.al. 2307.07443v1 link
2023-07-13 Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition Syed Talal Wasim et.al. 2307.06947v1 link
2023-07-13 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation Yi Wang et.al. 2307.06942v1 link
2023-07-13 Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation Yingqing He et.al. 2307.06940v1 link
2023-07-13 DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding Shuijing Liu et.al. 2307.06924v1 null
2023-07-13 Provable Multi-Task Representation Learning by Two-Layer ReLU Neural Networks Liam Collins et.al. 2307.06887v1 null
2023-07-13 LVLane: Deep Learning for Lane Detection and Classification in Challenging Conditions Zillur Rahman et.al. 2307.06853v1 link
2023-07-13 Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks Denis Coquenet et.al. 2307.06795v1 link
2023-07-13 Robotic surface exploration with vision and tactile sensing for cracks detection and characterisation Francesca Palermo et.al. 2307.06784v1 null
2023-07-13 Generalizing Supervised Deep Learning MRI Reconstruction to Multiple and Unseen Contrasts using Meta-Learning Hypernetworks Sriprabha Ramanarayanan et.al. 2307.06771v1 link
2023-07-13 Pairs of inner projections and two applications Ramlal Debnath et.al. 2307.06744v1 null
2023-07-12 Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of "Never Enough Training Data" Kishan Govind et.al. 2307.06322v1 null
2023-07-12 A geometric classification of rod complements in the 3-torus Connie On Yu Hui et.al. 2307.06317v1 null
2023-07-12 Facial Reenactment Through a Personalized Generator Ariel Elazary et.al. 2307.06307v1 null
2023-07-12 Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Mostafa Dehghani et.al. 2307.06304v1 null
2023-07-12 Feature Embeddings from Large-Scale Acoustic Bird Classifiers Enable Few-Shot Transfer Learning Burooj Ghani et.al. 2307.06292v1 null
2023-07-12 Stochastic Light Field Holography Florian Schiffers et.al. 2307.06277v1 null
2023-07-12 Machine learning and Topological data analysis identify unique features of human papillae in 3D scans Rayna Andreeva et.al. 2307.06255v1 null
2023-07-12 On the Importance of Denoising when Learning to Compress Images Benoit Brummer et.al. 2307.06233v1 link
2023-07-12 Ashaar: Automatic Analysis and Generation of Arabic Poetry Using Deep Learning Approaches Zaid Alyafeai et.al. 2307.06218v1 link
2023-07-12 Local Conditional Neural Fields for Versatile and Generalizable Large-Scale Reconstructions in Computational Imaging Hao Wang et.al. 2307.06207v1 null
2023-07-11 Fractonic Higher-Order Topological Phases in Open Quantum Systems Jian-Hao Zhang et.al. 2307.05474v1 null
2023-07-11 Differentiable Blocks World: Qualitative 3D Decomposition by Rendering Primitives Tom Monnier et.al. 2307.05473v1 null
2023-07-11 EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone Shraman Pramanick et.al. 2307.05463v1 null
2023-07-11 Improving the Security of Smartwatch Payment with Deep Learning George Webber et.al. 2307.05437v1 null
2023-07-11 One-Versus-Others Attention: Scalable Multimodal Integration Michal Golovanevsky et.al. 2307.05435v1 link
2023-07-11 Identifying Acoustic Wave Sources on the Sun. II. Improved Filter Techniques for Source Wavefield Seismology Shah Mohammad Bahauddin et.al. 2307.05433v1 null
2023-07-11 Effective Whitney Stratification of Real Algebraic Varieties Martin Helmer et.al. 2307.05427v1 null
2023-07-11 Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform Mateusz Wójcik et.al. 2307.05399v1 link
2023-07-11 ShredGP: Guitarist Style-Conditioned Tablature Generation Pedro Sarmento et.al. 2307.05324v1 null
2023-07-11 Class Instance Balanced Learning for Long-Tailed Classification Marc-Antoine Lavoie et.al. 2307.05322v1 null
2023-07-10 Semantic-SAM: Segment and Recognize Anything at Any Granularity Feng Li et.al. 2307.04767v1 link
2023-07-10 Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos Sagnik Majumder et.al. 2307.04760v1 null
2023-07-10 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement Anthony Simeonov et.al. 2307.04751v1 null
2023-07-10 RoCo: Dialectic Multi-Robot Collaboration with Large Language Models Zhao Mandi et.al. 2307.04738v1 link
2023-07-10 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Yuwei Guo et.al. 2307.04725v1 null
2023-07-10 Quark/Gluon Discrimination and Top Tagging with Dual Attention Transformer Minxuan He et.al. 2307.04723v1 null
2023-07-10 CVPR MultiEarth 2023 Deforestation Estimation Challenge:SpaceVision4Amazon Sunita Arya et.al. 2307.04715v1 null
2023-07-10 Multimodal brain age estimation using interpretable adaptive population-graph learning Kyriaki-Margarita Bintsi et.al. 2307.04639v1 null
2023-07-10 Learning Fine Pinch-Grasp Skills using Tactile Sensing from Real Demonstration Data Xiaofeng Mao et.al. 2307.04619v1 null
2023-07-10 Weakly-supervised positional contrastive learning: application to cirrhosis classification Emma Sarfati et.al. 2307.04617v1 null
2023-07-07 On the representation theory of cyclic and dihedral quandles Mohamed Elhamdadi et.al. 2307.03728v1 null
2023-07-07 Polybot: Training One Policy Across Robots While Embracing Variability Jonathan Yang et.al. 2307.03719v1 null
2023-07-07 Motion Magnification in Robotic Sonography: Enabling Pulsation-Aware Artery Segmentation Dianye Huang et.al. 2307.03698v1 null
2023-07-07 Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive Cancer Surgery Baoru Huang et.al. 2307.03662v1 null
2023-07-07 Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition Jianyuan Ni et.al. 2307.03638v1 null
2023-07-07 VesselVAE: Recursive Variational Autoencoders for 3D Blood Vessel Synthesis Paula Feldman et.al. 2307.03592v1 null
2023-07-07 SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks Xingyu Lin et.al. 2307.03567v1 null
2023-07-07 VariGrad: A Novel Feature Vector Architecture for Geometric Deep Learning on Unregistered Data Emmanuel Hartman et.al. 2307.03553v1 null
2023-07-07 TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning Zelun Zhang et.al. 2307.03465v1 null
2023-07-07 A Deep Active Contour Model for Delineating Glacier Calving Fronts Konrad Heidler et.al. 2307.03461v1 null
2023-07-06 Synthesizing Artistic Cinemagraphs from Text Aniruddha Mahapatra et.al. 2307.03190v1 null
2023-07-06 Long-term follow-up observations of extreme coronal line emitting galaxies Peter Clark et.al. 2307.03182v1 null
2023-07-06 Push Past Green: Learning to Look Behind Plant Foliage by Moving It Xiaoyu Zhang et.al. 2307.03175v1 null
2023-07-06 VideoGLUE: Video General Understanding Evaluation of Foundation Models Liangzhe Yuan et.al. 2307.03166v1 null
2023-07-06 Can Domain Adaptation Improve Accuracy and Fairness of Skin Lesion Classification? Janet Wang et.al. 2307.03157v1 null
2023-07-06 MultiVENT: Multilingual Videos of Events with Aligned Natural Text Kate Sanders et.al. 2307.03153v1 null
2023-07-06 Topology-Aware Loss for Aorta and Great Vessel Segmentation in Computed Tomography Images Seher Ozcelik et.al. 2307.03137v1 null
2023-07-06 Distilling Large Vision-Language Model with Out-of-Distribution Generalizability Xuanlin Li et.al. 2307.03135v1 link
2023-07-06 Benchmarking Test-Time Adaptation against Distribution Shifts in Image Classification Yongcan Yu et.al. 2307.03133v1 link
2023-07-06 VisKoP: Visual Knowledge oriented Programming for Interactive Knowledge Base Question Answering Zijun Yao et.al. 2307.03130v1 null
2023-07-05 Building Cooperative Embodied Agents Modularly with Large Language Models Hongxin Zhang et.al. 2307.02485v1 null
2023-07-05 Elastic Decision Transformer Yueh-Hua Wu et.al. 2307.02484v1 null
2023-07-05 What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Yan Zeng et.al. 2307.02469v1 null
2023-07-05 Supersymmetric asymptotically locally AdS$_5$ gravitational solitons Turkuler Durgut et.al. 2307.02466v1 null
2023-07-05 AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images Ao Cheng et.al. 2307.02464v1 null
2023-07-05 Expert-Agnostic Ultrasound Image Quality Assessment using Deep Variational Clustering Deepak Raina et.al. 2307.02462v1 null
2023-07-05 LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion Long Bai et.al. 2307.02452v1 link
2023-07-05 On Deep Learning Classification of Digitally Modulated Signals Using Raw I/Q Data John A. Snoap et.al. 2307.02450v1 null
2023-07-05 Vulnerable Source Code Detection using SonarCloud Code Analysis Alifia Puspaningrum et.al. 2307.02446v1 null
2023-07-05 Base Layer Efficiency in Scalable Human-Machine Coding Yalda Foroutan et.al. 2307.02430v1 null
2023-07-03 Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning Yuxiang Zhang et.al. 2307.01200v1 null
2023-07-03 Segment Anything Meets Point Tracking Frano Rajič et.al. 2307.01197v1 link
2023-07-03 Online nearest neighbor classification Sanjoy Dasgupta et.al. 2307.01170v1 null
2023-07-03 Don't freeze: Finetune encoders for better Self-Supervised HAR Vitor Fortes Rey et.al. 2307.01168v1 null
2023-07-03 Characteristic signatures of accreting binary black holes produced by eccentric minidisks John Ryan Westernacher-Schneider et.al. 2307.01154v1 null
2023-07-03 Integral cohomology rings of weighted Grassmann orbifolds and Rigidity properties Koushik Brahma et.al. 2307.01153v1 null
2023-07-03 Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis Salman Ul Hassan Dar et.al. 2307.01148v1 null
2023-07-05 AVSegFormer: Audio-Visual Segmentation with Transformer Shengyi Gao et.al. 2307.01146v2 link
2023-07-03 Cross-modality Attention Adapter: A Glioma Segmentation Fine-tuning Method for SAM Using Multimodal Brain MR Images Xiaoyu Shi et.al. 2307.01124v1 null
2023-07-03 Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities Jake S. Rhodes et.al. 2307.01077v1 null
2023-07-03 SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Lijun Yu et.al. 2306.17842v2 null
2023-06-30 Learning Evacuee Models from Robot-Guided Emergency Evacuation Experiments Mollik Nayyar et.al. 2306.17824v1 null
2023-06-30 Act3D: Infinite Resolution Action Detection Transformer for Robotic Manipulation Theophile Gervet et.al. 2306.17817v1 null
2023-06-30 Topologically Attributed Graphs for Shape Discrimination Justin Curry et.al. 2306.17805v1 null
2023-06-30 Vision Through the Veil: Differential Privacy in Federated Learning for Medical Image Classification Kishore Babu Nampalle et.al. 2306.17794v1 null
2023-06-30 Precision Anti-Cancer Drug Selection via Neural Ranking Vishal Dey et.al. 2306.17771v1 null
2023-06-30 Improved NL2SQL based on Multi-layer Expert Network Chenduo Hao et.al. 2306.17727v1 null
2023-06-30 Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling Li Sanqian et.al. 2306.17717v1 null
2023-06-30 Evaluation of the Benefits of Zero Velocity Update in Decentralized EKF-Based Cooperative Localization Algorithms for GNSS-Denied Multi-Robot Systems Cagri Kilic et.al. 2306.17703v1 null
2023-06-30 Generalized Time Warping Invariant Dictionary Learning for Time Series Classification and Clustering Ruiyu Xu et.al. 2306.17690v1 null
2023-06-29 An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training Zitian Chen et.al. 2306.17165v1 null
2023-06-29 Can Machines Garden? Systematically Comparing the AlphaGarden vs. Professional Horticulturalists Simeon Adebola et.al. 2306.17162v1 null
2023-06-29 FogROS2-SGC: A ROS2 Cloud Robotics Platform for Secure Global Connectivity Kaiyuan Chen et.al. 2306.17157v1 null
2023-06-29 Orbit Classification of asteroids using implementation of radial Basis Function on Support Vector Machines Yashvir Tiberwal et.al. 2306.17138v1 null
2023-06-29 On separably integrable symmetric convex bodies Vladyslav Yaskin et.al. 2306.17127v1 null
2023-06-29 PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN Kai-En Lin et.al. 2306.17123v1 null
2023-06-29 Learning Nuclei Representations with Masked Image Modelling Piotr Wójcik et.al. 2306.17116v1 null
2023-06-29 Deep Ensemble for Rotorcraft Attitude Prediction Hikmat Khan et.al. 2306.17104v1 null
2023-06-29 Twice Binnable Color Filter Arrays Mritunjay Singh et.al. 2306.17078v1 null
2023-06-29 Extremal behavior of reduced type of one dimensional rings Sarasij Maitra et.al. 2306.17069v1 null
2023-06-28 Class Numbers, Congruent Numbers and Umbral Moonshine Miranda C. N. Cheng et.al. 2306.16414v1 null
2023-06-28 Information-Computation Tradeoffs for Learning Margin Halfspaces with Random Classification Noise Ilias Diakonikolas et.al. 2306.16352v1 null
2023-06-28 Accurate, uncertainty-aware classification of molecular chemical motifs from multi-modal X-ray absorption spectroscopy Matthew R. Carbone et.al. 2306.16349v1 null
2023-06-28 DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy Yiwen Zhang et.al. 2306.16324v1 null
2023-06-28 Universal theory of spin-momentum-orbital-site locking Yuntian Liu et.al. 2306.16312v1 null
2023-06-28 Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis An Wang et.al. 2306.16285v1 link
2023-06-28 Emotion Analysis of Tweets Banning Education in Afghanistan Mohammad Ali Hussiny et.al. 2306.16268v1 null
2023-06-28 Reconfigurable Robot Control Using Flexible Coupling Mechanisms Sha Yi et.al. 2306.16265v1 null
2023-06-28 Latent SDEs on Homogeneous Spaces Sebastian Zeng et.al. 2306.16248v1 null
2023-06-28 Investigating the Uncanny Valley Phenomenon Through the Temporal Dynamics of Neural Responses to Virtual Characters Chiara Gorlini et.al. 2306.16233v1 null
2023-06-27 Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties Hsiao-Yu Tung et.al. 2306.15668v1 null
2023-06-27 Enhancing Representation Learning on High-Dimensional, Small-Size Tabular Data: A Divide and Conquer Method with Ensembled VAEs Navindu Leelarathna et.al. 2306.15661v1 null
2023-06-27 Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos Chiori Hori et.al. 2306.15644v1 null
2023-06-27 Biclustering random matrix partitions with an application to classification of forensic body fluids Chieh-Hsi Wu et.al. 2306.15622v1 null
2023-06-27 Recurrent Neural Network-coupled SPAD TCSPC System for Real-time Fluorescence Lifetime Imaging Yang Lin et.al. 2306.15599v1 null
2023-06-27 Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning Sherly Alfonso-Sánchez et.al. 2306.15585v1 null
2023-06-27 Parity doublet model for baryon octets: diquark classifications and mass hierarchy based on the quark-line diagram Takuya Minamikawa et.al. 2306.15564v1 null
2023-06-27 You Can Mask More For Extremely Low-Bitrate Image Compression Anqi Li et.al. 2306.15561v1 link
2023-06-27 A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms Cristina Silvano et.al. 2306.15552v1 null
2023-06-27 Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames Yunfan Lu et.al. 2306.15507v1 null
2023-06-26 FunQA: Towards Surprising Video Comprehension Binzhu Xie et.al. 2306.14899v1 link
2023-06-26 Mapping out phase diagrams with generative classifiers Julian Arnold et.al. 2306.14894v1 null
2023-06-26 Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction Majed El Helou et.al. 2306.14891v1 link
2023-06-26 A Fully Unsupervised Instance Segmentation Technique for White Blood Cell Images Shrijeet Biswas et.al. 2306.14875v1 null
2023-06-26 ANYmal Parkour: Learning Agile Navigation for Quadrupedal Robots David Hoeller et.al. 2306.14874v1 null
2023-06-26 Leveraging Task Structures for Improved Identifiability in Neural Network Representations Wenlin Chen et.al. 2306.14861v1 null
2023-06-26 ViNT: A Foundation Model for Visual Navigation Dhruv Shah et.al. 2306.14846v1 null
2023-06-26 An open-source robust machine learning platform for real-time detection and classification of 2D material flakes Jan-Lucas Uslu et.al. 2306.14845v1 null
2023-06-26 A Flyweight CNN with Adaptive Decoder for Schistosoma mansoni Egg Detection Leonardo de Melo Joao et.al. 2306.14840v1 null
2023-06-26 Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification Chih-Yao Chen et.al. 2306.14822v1 link
2023-06-23 Adversarial Robustness Certification for Bayesian Neural Networks Matthew Wicker et.al. 2306.13614v1 link
2023-06-23 TACOformer:Token-channel compounded Cross Attention for Multimodal Emotion Recognition Xinda Li et.al. 2306.13592v1 null
2023-06-23 Estimating Residential Solar Potential Using Aerial Data Ross Goroshin et.al. 2306.13564v1 null
2023-06-23 Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning Takumi Yoshida et.al. 2306.13561v1 null
2023-06-26 FPGA Implementation of Convolutional Neural Network for Real-Time Handwriting Recognition Shichen Qiao et.al. 2306.13557v2 link
2023-06-23 Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation Massimiliano Patacchiola et.al. 2306.13554v1 link
2023-06-23 Manifold Contrastive Learning with Variational Lie Group Operators Kion Fallah et.al. 2306.13544v1 null
2023-06-23 Torsion Graph Neural Networks Cong Shen et.al. 2306.13541v1 link
2023-06-23 Topological learning for the classification of disorder: an application to the design of metasurfaces Tristan Madeleine et.al. 2306.13540v1 null
2023-06-23 WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological Attributes Satoshi Tsutsui et.al. 2306.13531v1 link
2023-06-22 A Comparison of Time-based Models for Multimodal Emotion Recognition Ege Kesim et.al. 2306.13076v1 null
2023-06-22 Auditing Predictive Models for Intersectional Biases Kate S. Boxer et.al. 2306.13064v1 null
2023-06-22 Impacts and Risk of Generative AI Technology on Cyber Defense Subash Neupane et.al. 2306.13033v1 null
2023-06-22 Toward Automated Detection of Microbleeds with Anatomical Scale Localization: A Complete Clinical Diagnosis Support Using Deep Learning Jun-Ho Kim et.al. 2306.13020v1 null
2023-06-22 Minimalist and High-Quality Panoramic Imaging with PSF-aware Transformers Qi Jiang et.al. 2306.12992v1 link
2023-06-22 Can a single image processing algorithm work equally well across all phases of DCE-MRI? Adam G. Tattersall et.al. 2306.12988v1 null
2023-06-22 Radiation Emission during the Erasure of Magnetic Monopoles Maximilian Bachmaier et.al. 2306.12958v1 null
2023-06-22 Robust Semantic Segmentation: Strong Adversarial Attacks and Fast Training of Robust Models Francesco Croce et.al. 2306.12941v1 link
2023-06-22 Deficit of Hot Dust in Low-redshift Active Galactic Nuclei Suyeon Son et.al. 2306.12927v1 null
2023-06-22 Machine-Learning-Assisted and Real-Time-Feedback-Controlled Growth of InAs/GaAs Quantum Dots Chao Shen et.al. 2306.12898v1 null
2023-06-21 Spectroscopy of the Supernova H0pe Host Galaxy at Redshift 1.78 M. Polletta et.al. 2306.12385v1 null
2023-06-21 Geometric Algorithms for $k$-NN Poisoning Diego Ihara Centurion et.al. 2306.12377v1 null
2023-06-21 M-VAAL: Multimodal Variational Adversarial Active Learning for Downstream Medical Image Analysis Tasks Bidur Khanal et.al. 2306.12376v1 link
2023-06-21 One Policy to Dress Them All: Learning to Dress People with Diverse Poses and Garments Yufei Wang et.al. 2306.12372v1 null
2023-06-21 Attention Hybrid Variational Net for Accelerated MRI Reconstruction Guoyao Shen et.al. 2306.12365v1 null
2023-06-21 Linear and Non-Linear Barrier Coverage in Deterministic and Uncertain environment in WSNs: A New Classification Adda Boualem et.al. 2306.12355v1 null
2023-06-21 An efficient, provably exact algorithm for the 0-1 loss linear classification problem Xi He et.al. 2306.12344v1 null
2023-06-21 Geometric Pooling: maintaining more useful information Hao Xu et.al. 2306.12341v1 null
2023-06-22 Do you still need a manual smart contract audit? Isaac David et.al. 2306.12338v2 null
2023-06-22 Beyond Deep Ensembles: A Large-Scale Evaluation of Bayesian Deep Learning under Distribution Shift Florian Seligmann et.al. 2306.12306v2 link
2023-06-20 Segment Anything Model (SAM) for Radiation Oncology Lian Zhang et.al. 2306.11730v1 null
2023-06-20 Dense Video Object Captioning from Disjoint Supervision Xingyi Zhou et.al. 2306.11729v1 link
2023-06-20 How can objects help action recognition? Xingyi Zhou et.al. 2306.11726v1 link
2023-06-20 Low-complexity Multidimensional DCT Approximations V. A. Coutinho et.al. 2306.11724v1 null
2023-06-20 Meta-Analysis of Transfer Learning for Segmentation of Brain Lesions Sovesh Mohapatra et.al. 2306.11714v1 null
2023-06-20 Hexagonal circular 3-webs with polar curves of degree three Sergey I. Agafonov et.al. 2306.11707v1 null
2023-06-20 SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT Yuhao Nie et.al. 2306.11682v1 null
2023-06-20 The Implicit Bias of Batch Normalization in Linear Models and Two-layer Linear Convolutional Neural Networks Yuan Cao et.al. 2306.11680v1 null
2023-06-20 A primal-dual data-driven method for computational optical imaging with a photonic lantern Carlos Santos Garcia et.al. 2306.11679v1 null
2023-06-20 Deep Learning Methods for Retinal Blood Vessel Segmentation: Evaluation on Images with Retinopathy of Prematurity Gorana Gojić et.al. 2306.11576v1 null
2023-06-16 Variational quantum algorithms for machine learning: theory and applications Stefano Mangini et.al. 2306.09984v1 null
2023-06-16 HePCo: Data-Free Heterogeneous Prompt Consolidation for Continual Federated Learning Shaunak Halbe et.al. 2306.09970v1 null
2023-06-16 Training shallow ReLU networks on noisy data using hinge loss: when do we overfit and is it benign? Erin George et.al. 2306.09955v1 null
2023-06-16 Towards Better Certified Segmentation via Diffusion Models Othmane Laousy et.al. 2306.09949v1 null
2023-06-16 Knowledge Distillation for Efficient Audio-Visual Video Captioning Özkan Çaylı et.al. 2306.09947v1 null
2023-06-16 RealImpact: A Dataset of Impact Sound Fields for Real Objects Samuel Clarke et.al. 2306.09944v1 null
2023-06-16 A classification of supersymmetric Kaluza-Klein black holes with a single axial symmetry David Katona et.al. 2306.09933v1 null
2023-06-16 A Metaheuristic-based Machine Learning Approach for Energy Prediction in Mobile App Development Seyed Jalaleddin Mousavirad et.al. 2306.09931v1 null
2023-06-16 Learning to Summarize and Answer Questions about a Virtual Robot's Past Actions Chad DeChant et.al. 2306.09922v1 null
2023-06-16 No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference Animesh Nighojkar et.al. 2306.09918v1 null
2023-06-16 UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video Zhi-Hao Lin et.al. 2306.09349v2 null
2023-06-15 Causal classification of spatiotemporal quantum correlations Minjeong Song et.al. 2306.09336v1 null
2023-06-15 Class-Conditional Conformal Prediction With Many Classes Tiffany Ding et.al. 2306.09335v1 link
2023-06-15 Personalized Image Enhancement Featuring Masked Style Modeling Satoshi Kosugi et.al. 2306.09334v1 link
2023-06-15 Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers Dominick Reilly et.al. 2306.09331v1 link
2023-06-15 WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning Embeddings Zijie J. Wang et.al. 2306.09328v1 link
2023-06-15 Language-Guided Music Recommendation for Video via Prompt Analogies Daniel McKee et.al. 2306.09327v1 null
2023-06-15 Single-Stage Visual Query Localization in Egocentric Videos Hanwen Jiang et.al. 2306.09324v1 null
2023-06-15 Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter Satoshi Kosugi et.al. 2306.09321v1 link
2023-06-15 Learnable Weight Initialization for Volumetric Medical Image Segmentation Shahina Kunhimon et.al. 2306.09320v1 link
2023-06-13 Classification of branched Willmore spheres Dorian Martino et.al. 2306.07965v1 null
2023-06-13 Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters Ganesh Ramachandra Kini et.al. 2306.07960v1 link
2023-06-13 Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Shuai Yang et.al. 2306.07954v1 null
2023-06-13 MOFI: Learning Image Representations from Noisy Entity Annotated Images Wentao Wu et.al. 2306.07952v1 null
2023-06-13 Image Captioners Are Scalable Vision Learners Too Michael Tschannen et.al. 2306.07915v1 null
2023-06-13 Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark Łukasz Augustyniak et.al. 2306.07902v1 link
2023-06-13 Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks Veniamin Veselovsky et.al. 2306.07899v1 link
2023-06-13 CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems Md Shahriar Iqbal et.al. 2306.07888v1 null
2023-06-13 Deep Learning-Enabled Zero-Touch Device Identification: Mitigating the Impact of Channel Variability Through MIMO Diversity Bechir Hamdaoui et.al. 2306.07878v1 null
2023-06-13 On the flow unsteadiness and operational characteristics of a novel supersonic fluidic oscillator Spandan Maikap et.al. 2306.07849v1 null
2023-06-12 Waffling around for Performance: Visual Classification with Random Words and Broad Concepts Karsten Roth et.al. 2306.07282v1 link
2023-06-12 The Cheltsov--Rubinstein problem for strongly asymptotically log del Pezzo surfaces Chenzi Jin et.al. 2306.07278v1 null
2023-06-12 Gaussian Membership Inference Privacy Tobias Leemann et.al. 2306.07273v1 null
2023-06-12 MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images Junchen Zhu et.al. 2306.07257v1 null
2023-06-12 On the Expected Size of Conformal Prediction Sets Guneet S. Dhillon et.al. 2306.07254v1 null
2023-06-12 RB-Dust -- A Reference-based Dataset for Vision-based Dust Removal Peter Buckel et.al. 2306.07244v1 null
2023-06-12 Strokes2Surface: Recovering Curve Networks From 4D Architectural Design Sketches S. Rasoulzadeh et.al. 2306.07220v1 null
2023-06-12 Cyclic objects from surfaces Ivan Bartulović et.al. 2306.07216v1 null
2023-06-12 Valley: Video Assistant with Large Language model Enhanced abilitY Ruipu Luo et.al. 2306.07207v1 null
2023-06-12 A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation Jeremy Gwinnup et.al. 2306.07198v1 null
2023-06-09 Shock Cooling and Possible Precursor Emission in the Early Light Curve of the Type II SN 2023ixf Griffin Hosseinzadeh et.al. 2306.06097v1 null
2023-06-09 Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding Mu Cai et.al. 2306.06094v1 null
2023-06-09 Virtual Node Tuning for Few-shot Node Classification Zhen Tan et.al. 2306.06063v1 null
2023-06-09 Ion-Driven Instabilities in the Inner Heliosphere II: Classification and Multi-Dimensional Mapping Mihailo M. Martinovic et.al. 2306.06060v1 null
2023-06-09 Exploring the Impact of Image Resolution on Chest X-ray Classification Performance Alessandro Wollek et.al. 2306.06051v1 null
2023-06-09 How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models? Yifei Ming et.al. 2306.06048v1 null
2023-06-09 GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields Barbara Roessle et.al. 2306.06044v1 null
2023-06-09 WindowNet: Learnable Windows for Chest X-ray Classification Alessandro Wollek et.al. 2306.06038v1 null
2023-06-09 Benchmarking self-supervised video representation learning Akash Kumar et.al. 2306.06010v1 null
2023-06-09 Beyond Detection: Visual Realism Assessment of Deepfakes Luka Dragar et.al. 2306.05985v1 null
2023-06-08 MIMIC-IT: Multi-Modal In-Context Instruction Tuning Bo Li et.al. 2306.05425v1 link
2023-06-08 Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models Muhammad Maaz et.al. 2306.05424v1 link
2023-06-08 ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Changyao Tian et.al. 2306.05423v1 null
2023-06-08 Tracking Everything Everywhere All at Once Qianqian Wang et.al. 2306.05422v1 null
2023-06-08 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction Jiawei He et.al. 2306.05418v1 null
2023-06-08 Tracking Objects with 3D Representation from Videos Jiawei He et.al. 2306.05416v1 null
2023-06-08 Quantum symmetries in 2+1 dimensions: Carroll, (a)dS-Carroll, Galilei and (a)dS-Galilei Tomasz Trześniewski et.al. 2306.05409v1 null
2023-06-08 Deformation theory for prismatic $G$-displays Kazuhiro Ito et.al. 2306.05361v1 null
2023-06-08 Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models Nan Liu et.al. 2306.05357v1 null
2023-06-08 Predictive Modeling of Equine Activity Budgets Using a 3D Skeleton Reconstructed from Surveillance Recordings Ernest Pokropek et.al. 2306.05311v1 null
2023-06-08 Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt Kai Chen et.al. 2306.04607v2 null
2023-06-07 MarineVRS: Marine Video Retrieval System with Explainability via Semantic Understanding Tan-Sang Ha et.al. 2306.04593v1 null
2023-06-07 A Dataset for Deep Learning-based Bone Structure Analyses in Total Hip Arthroplasty Kaidong Zhang et.al. 2306.04579v1 link
2023-06-07 ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models Sophie Jentzsch et.al. 2306.04563v1 link
2023-06-07 Contrastive Bootstrapping for Label Refinement Shudi Hou et.al. 2306.04544v1 null
2023-06-07 Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications Paul Pu Liang et.al. 2306.04539v1 link
2023-06-07 Long-form analogies generated by chatGPT lack human-like psycholinguistic properties S. M. Seals et.al. 2306.04537v1 null
2023-06-07 ContriMix: Unsupervised disentanglement of content and attribute for domain generalization in microscopy image analysis Tan H. Nguyen et.al. 2306.04527v1 null
2023-06-07 Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT Haoran Zhang et.al. 2306.04512v1 null
2023-06-07 Hardness of Deceptive Certificate Selection Stephan Wäldchen et.al. 2306.04505v1 null
2023-06-06 CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental Fine-Tuning and Multi-Task Learning with Label Descriptions Janis Goldzycher et.al. 2306.03907v1 null
2023-06-06 Utterance Classification with Logical Neural Network: Explainable AI for Mental Disorder Diagnosis Yeldar Toleubay et.al. 2306.03902v1 null
2023-06-06 Towards Label-free Scene Understanding by Vision Foundation Models Runnan Chen et.al. 2306.03899v1 null
2023-06-06 Multi-Label ECG Classification using Temporal Convolutional Neural Network Eedara Prabhakararao et.al. 2306.03844v1 null
2023-06-06 Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning Yiman Liu et.al. 2306.03835v1 null
2023-06-06 MTS2Graph: Interpretable Multivariate Time Series Classification with Temporal Evolving Graphs Raneen Younis et.al. 2306.03834v1 null
2023-06-06 GEO-Bench: Toward Foundation Models for Earth Monitoring Alexandre Lacoste et.al. 2306.03831v1 link
2023-06-06 Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How Sebastian Pineda Arango et.al. 2306.03828v1 null
2023-06-06 Learning to Ground Instructional Articles in Videos through Narrations Effrosyni Mavroudi et.al. 2306.03802v1 null
2023-06-06 Matched Pair Calibration for Ranking Fairness Hannah Korevaar et.al. 2306.03775v1 null
2023-06-05 Neuralangelo: High-Fidelity Neural Surface Reconstruction Zhaoshuo Li et.al. 2306.03092v1 null
2023-06-05 Dismantling Hate: Understanding Hate Speech Trends Against NBA Athletes Edinam Kofi Klutse et.al. 2306.03086v1 null
2023-06-05 MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion Chiyu Max Jiang et.al. 2306.03083v1 null
2023-06-05 **Of Mice and Mates: Automated Classification and Modelling of Mouse

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages