Skip to main content

Showing 1–50 of 296 results for author: Tang, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.12784  [pdf, other

    cs.AI cs.CL cs.LG

    JudgeBench: A Benchmark for Evaluating LLM-based Judges

    Authors: Sijun Tan, Siyuan Zhuang, Kyle Montgomery, William Y. Tang, Alejandro Cuadron, Chenguang Wang, Raluca Ada Popa, Ion Stoica

    Abstract: LLM-based judges have emerged as a scalable alternative to human evaluation and are increasingly used to assess, compare, and improve models. However, the reliability of LLM-based judges themselves is rarely scrutinized. As LLMs become more advanced, their responses grow more sophisticated, requiring stronger judges to evaluate them. Existing benchmarks primarily focus on a judge's alignment with… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: preprint

  2. arXiv:2410.11241  [pdf, other

    cs.CV

    Learning Diffusion Model from Noisy Measurement using Principled Expectation-Maximization Method

    Authors: Weimin Bai, Weiheng Tang, Enze Ye, Siyi Chen, Wenzheng Chen, He Sun

    Abstract: Diffusion models have demonstrated exceptional ability in modeling complex image distributions, making them versatile plug-and-play priors for solving imaging inverse problems. However, their reliance on large-scale clean datasets for training limits their applicability in scenarios where acquiring clean data is costly or impractical. Recent approaches have attempted to learn diffusion models dire… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  3. arXiv:2410.11124  [pdf, other

    cs.CV cs.LG stat.AP

    Real-Time Localization and Bimodal Point Pattern Analysis of Palms Using UAV Imagery

    Authors: Kangning Cui, Wei Tang, Rongkun Zhu, Manqi Wang, Gregory D. Larsen, Victor P. Pauca, Sarra Alqahtani, Fan Yang, David Segurado, Paul Fine, Jordan Karubian, Raymond H. Chan, Robert J. Plemmons, Jean-Michel Morel, Miles R. Silman

    Abstract: Understanding the spatial distribution of palms within tropical forests is essential for effective ecological monitoring, conservation strategies, and the sustainable integration of natural forest products into local and global supply chains. However, the analysis of remotely sensed data in these environments faces significant challenges, such as overlapping palm and tree crowns, uneven shading ac… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 25 pages, 8 figures, 5 tables

  4. arXiv:2410.08877  [pdf, other

    cs.LG cs.DB cs.IR cs.MM

    Interdependency Matters: Graph Alignment for Multivariate Time Series Anomaly Detection

    Authors: Yuanyi Wang, Haifeng Sun, Chengsen Wang, Mengde Zhu, Jingyu Wang, Wei Tang, Qi Qi, Zirui Zhuang, Jianxin Liao

    Abstract: Anomaly detection in multivariate time series (MTS) is crucial for various applications in data mining and industry. Current industrial methods typically approach anomaly detection as an unsupervised learning task, aiming to identify deviations by estimating the normal distribution in noisy, label-free datasets. These methods increasingly incorporate interdependencies between channels through grap… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  5. arXiv:2410.06866  [pdf, other

    cs.CV eess.IV

    Secure Video Quality Assessment Resisting Adversarial Attacks

    Authors: Ao-Xiang Zhang, Yu Ran, Weixuan Tang, Yuan-Gen Wang, Qingxiao Guan, Chunsheng Yang

    Abstract: The exponential surge in video traffic has intensified the imperative for Video Quality Assessment (VQA). Leveraging cutting-edge architectures, current VQA models have achieved human-comparable accuracy. However, recent studies have revealed the vulnerability of existing VQA models against adversarial attacks. To establish a reliable and practical assessment system, a secure VQA model capable of… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  6. arXiv:2410.05115  [pdf, other

    quant-ph cs.AI eess.SY

    AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and Tree Search

    Authors: Wei Tang, Yiheng Duan, Yaroslav Kharkov, Rasool Fakoor, Eric Kessler, Yunong Shi

    Abstract: Quantum computers have the potential to outperform classical computers in important tasks such as optimization and number factoring. They are characterized by limited connectivity, which necessitates the routing of their computational bits, known as qubits, to specific locations during program execution to carry out quantum operations. Traditionally, the NP-hard optimization problem of minimizing… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 11 pages, 11 figures, International Conference on Quantum Computing and Engineering - QCE24

  7. arXiv:2410.04203  [pdf, other

    cs.AI

    RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization

    Authors: Hanyang Zhao, Genta Indra Winata, Anirban Das, Shi-Xiong Zhang, David D. Yao, Wenpin Tang, Sambit Sahu

    Abstract: Recently, numerous preference optimization algorithms have been introduced as extensions to the Direct Preference Optimization (DPO) family. While these methods have successfully aligned models with human preferences, there is a lack of understanding regarding the contributions of their additional components. Moreover, fair and consistent comparisons are scarce, making it difficult to discern whic… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  8. arXiv:2410.03741  [pdf, other

    cs.HC cs.AI

    Towards Democratization of Subspeciality Medical Expertise

    Authors: Jack W. O'Sullivan, Anil Palepu, Khaled Saab, Wei-Hung Weng, Yong Cheng, Emily Chu, Yaanik Desai, Aly Elezaby, Daniel Seung Kim, Roy Lan, Wilson Tang, Natalie Tapaskar, Victoria Parikh, Sneha S. Jain, Kavita Kulkarni, Philip Mansfield, Dale Webster, Juraj Gottweis, Joelle Barral, Mike Schaekermann, Ryutaro Tanno, S. Sara Mahdavi, Vivek Natarajan, Alan Karthikesalingam, Euan Ashley , et al. (1 additional authors not shown)

    Abstract: The scarcity of subspecialist medical expertise, particularly in rare, complex and life-threatening diseases, poses a significant challenge for healthcare delivery. This issue is particularly acute in cardiology where timely, accurate management determines outcomes. We explored the potential of AMIE (Articulate Medical Intelligence Explorer), a large language model (LLM)-based experimental AI syst… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  9. arXiv:2410.03509  [pdf, other

    cs.RO

    GAP-RL: Grasps As Points for RL Towards Dynamic Object Grasping

    Authors: Pengwei Xie, Siang Chen, Qianrun Chen, Wei Tang, Dingchang Hu, Yixiang Dai, Rui Chen, Guijin Wang

    Abstract: Dynamic grasping of moving objects in complex, continuous motion scenarios remains challenging. Reinforcement Learning (RL) has been applied in various robotic manipulation tasks, benefiting from its closed-loop property. However, existing RL-based methods do not fully explore the potential for enhancing visual representations. In this letter, we propose a novel framework called Grasps As Points f… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted by RA-L for further publication, may be unavailable or updated in the future

  10. arXiv:2410.02551  [pdf, other

    cs.LG cs.AI cs.CL

    ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration

    Authors: Zixiang Wang, Yinghao Zhu, Huiya Zhao, Xiaochen Zheng, Tianlong Wang, Wen Tang, Yasha Wang, Chengwei Pan, Ewen M. Harrison, Junyi Gao, Liantao Ma

    Abstract: We introduce ColaCare, a framework that enhances Electronic Health Record (EHR) modeling through multi-agent collaboration driven by Large Language Models (LLMs). Our approach seamlessly integrates domain-specific expert models with LLMs to bridge the gap between structured EHR data and text-based reasoning. Inspired by clinical consultations, ColaCare employs two types of agents: DoctorAgent and… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  11. arXiv:2409.19961  [pdf, other

    cs.CV cs.CL

    Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval

    Authors: Yabing Wang, Le Wang, Qiang Zhou, Zhibin Wang, Hao Li, Gang Hua, Wei Tang

    Abstract: Cross-lingual cross-modal retrieval (CCR) aims to retrieve visually relevant content based on non-English queries, without relying on human-labeled cross-modal data pairs during training. One popular approach involves utilizing machine translation (MT) to create pseudo-parallel data pairs, establishing correspondence between visual and non-English textual data. However, aligning their representati… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: Accepted by ACM Multimedia

  12. arXiv:2409.18269  [pdf, ps, other

    cs.GT

    Intrinsic Robustness of Prophet Inequality to Strategic Reward Signaling

    Authors: Wei Tang, Haifeng Xu, Ruimin Zhang, Derek Zhu

    Abstract: Prophet inequality concerns a basic optimal stopping problem and states that simple threshold stopping policies -- i.e., accepting the first reward larger than a certain threshold -- can achieve tight $\frac{1}{2}$-approximation to the optimal prophet value. Motivated by its economic applications, this paper studies the robustness of this approximation to natural strategic manipulations in which e… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  13. arXiv:2409.15866  [pdf, other

    cs.RO cs.LG

    Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning

    Authors: Jiayu Chen, Chao Yu, Guosheng Li, Wenhao Tang, Xinyi Yang, Botian Xu, Huazhong Yang, Yu Wang

    Abstract: Multi-UAV pursuit-evasion, where pursuers aim to capture evaders, poses a key challenge for UAV swarm intelligence. Multi-agent reinforcement learning (MARL) has demonstrated potential in modeling cooperative behaviors, but most RL-based approaches remain constrained to simplified simulations with limited dynamics or fixed scenarios. Previous attempts to deploy RL policy to real-world pursuit-evas… ▽ More

    Submitted 25 September, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

  14. arXiv:2409.13262  [pdf, other

    cs.CL cs.SD eess.AS

    Large Language Model Should Understand Pinyin for Chinese ASR Error Correction

    Authors: Yuang Li, Xiaosong Qiao, Xiaofeng Zhao, Huan Zhao, Wei Tang, Min Zhang, Hao Yang

    Abstract: Large language models can enhance automatic speech recognition systems through generative error correction. In this paper, we propose Pinyin-enhanced GEC, which leverages Pinyi, the phonetic representation of Mandarin Chinese, as supplementary information to improve Chinese ASR error correction. Our approach only utilizes synthetic errors for training and employs the one-best hypothesis during inf… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  15. arXiv:2409.11564  [pdf, other

    cs.CL cs.AI cs.CV cs.LG eess.AS

    Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

    Authors: Genta Indra Winata, Hanyang Zhao, Anirban Das, Wenpin Tang, David D. Yao, Shi-Xiong Zhang, Sambit Sahu

    Abstract: Preference tuning is a crucial process for aligning deep generative models with human preferences. This survey offers a thorough overview of recent advancements in preference tuning and the integration of human feedback. The paper is organized into three main sections: 1) introduction and preliminaries: an introduction to reinforcement learning frameworks, preference tuning tasks, models, and data… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: Survey paper

  16. arXiv:2409.10032  [pdf, other

    cs.RO

    Embodiment-Agnostic Action Planning via Object-Part Scene Flow

    Authors: Weiliang Tang, Jia-Hui Pan, Wei Zhan, Jianshu Zhou, Huaxiu Yao, Yun-Hui Liu, Masayoshi Tomizuka, Mingyu Ding, Chi-Wing Fu

    Abstract: Observing that the key for robotic action planning is to understand the target-object motion when its associated part is manipulated by the end effector, we propose to generate the 3D object-part scene flow and extract its transformations to solve the action trajectories for diverse embodiments. The advantage of our approach is that it derives the robot action explicitly from object motion predict… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 8 pages, 7 figures

  17. arXiv:2409.08487  [pdf, other

    cs.LG cs.AI stat.ML

    Sub-graph Based Diffusion Model for Link Prediction

    Authors: Hang Li, Wei Jin, Geri Skenderi, Harry Shomer, Wenzhuo Tang, Wenqi Fan, Jiliang Tang

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) represent a contemporary class of generative models with exceptional qualities in both synthesis and maximizing the data likelihood. These models work by traversing a forward Markov Chain where data is perturbed, followed by a reverse process where a neural network learns to undo the perturbations and recover the original data. There have been incre… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 17 pages, 3 figures

  18. arXiv:2409.08400  [pdf, ps, other

    cs.LG cs.AI

    Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning

    Authors: Hanyang Zhao, Haoxian Chen, Ji Zhang, David D. Yao, Wenpin Tang

    Abstract: Reinforcement Learning from human feedback (RLHF) has been shown a promising direction for aligning generative models with human intent and has also been explored in recent works for alignment of diffusion generative models. In this work, we provide a rigorous treatment by formulating the task of fine-tuning diffusion models, with reward functions learned from human feedback, as an exploratory con… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  19. arXiv:2409.06105  [pdf, other

    cs.CV

    SGC-VQGAN: Towards Complex Scene Representation via Semantic Guided Clustering Codebook

    Authors: Chenjing Ding, Chiyu Wang, Boshi Liu, Xi Guo, Weixuan Tang, Wei Wu

    Abstract: Vector quantization (VQ) is a method for deterministically learning features through discrete codebook representations. Recent works have utilized visual tokenizers to discretize visual regions for self-supervised representation learning. However, a notable limitation of these tokenizers is lack of semantics, as they are derived solely from the pretext task of reconstructing raw image pixels in an… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  20. arXiv:2409.06104  [pdf, other

    cs.CV

    LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo

    Authors: Wei Zhi Tang, Daniel Rebain, Kostantinos G. Derpanis, Kwang Moo Yi

    Abstract: We present a method for reconstructing a clear Neural Radiance Field (NeRF) even with fast camera motions. To address blur artifacts, we leverage both (blurry) RGB images and event camera data captured in a binocular configuration. Importantly, when reconstructing our clear NeRF, we consider the camera modeling imperfections that arise from the simple pinhole camera model as learned embeddings for… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  21. arXiv:2409.05463  [pdf, other

    cs.CV

    DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation

    Authors: Wei Wu, Xi Guo, Weixuan Tang, Tingxuan Huang, Chiyu Wang, Dongyue Chen, Chenjing Ding

    Abstract: Recent advancements in generative models have provided promising solutions for synthesizing realistic driving videos, which are crucial for training autonomous driving perception models. However, existing approaches often struggle with multi-view video generation due to the challenges of integrating 3D information while maintaining spatial-temporal consistency and effectively learning from a unifi… ▽ More

    Submitted 12 September, 2024; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: Homepage: https://metadrivescape.github.io/papers_project/drivescapev1/index.html

  22. arXiv:2408.16469  [pdf, other

    cs.CV

    Multi-source Domain Adaptation for Panoramic Semantic Segmentation

    Authors: Jing Jiang, Sicheng Zhao, Jiankun Zhu, Wenbo Tang, Zhaopan Xu, Jidong Yang, Pengfei Xu, Hongxun Yao

    Abstract: Panoramic semantic segmentation has received widespread attention recently due to its comprehensive 360\degree field of view. However, labeling such images demands greater resources compared to pinhole images. As a result, many unsupervised domain adaptation methods for panoramic semantic segmentation have emerged, utilizing real pinhole images or low-cost synthetic panoramic images. But, the segm… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 9 pages, 7 figures, 5 tables

  23. arXiv:2408.14764  [pdf, other

    cs.CV cs.MM

    SynthDoc: Bilingual Documents Synthesis for Visual Document Understanding

    Authors: Chuanghao Ding, Xuejing Liu, Wei Tang, Juan Li, Xiaoliang Wang, Rui Zhao, Cam-Tu Nguyen, Fei Tan

    Abstract: This paper introduces SynthDoc, a novel synthetic document generation pipeline designed to enhance Visual Document Understanding (VDU) by generating high-quality, diverse datasets that include text, images, tables, and charts. Addressing the challenges of data acquisition and the limitations of existing datasets, SynthDoc leverages publicly available corpora and advanced rendering tools to create… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  24. arXiv:2408.14369  [pdf, other

    cs.LG

    Exploiting Conjugate Label Information for Multi-Instance Partial-Label Learning

    Authors: Wei Tang, Weijia Zhang, Min-Ling Zhang

    Abstract: Multi-instance partial-label learning (MIPL) addresses scenarios where each training sample is represented as a multi-instance bag associated with a candidate label set containing one true label and several false positives. Existing MIPL algorithms have primarily focused on mapping multi-instance bags to candidate label sets for disambiguation, disregarding the intrinsic properties of the label sp… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Accepted at IJCAI 2024. The code can be found at https://github.com/tangw-seu/ELIMIPL

  25. arXiv:2408.12069  [pdf, other

    cs.IT eess.SP

    Rotatable Block-Controlled RIS: Bridging the Performance Gap to Element-Controlled Systems

    Authors: Weicong Chen, Xinyi Yang, Chao-Kai Wen, Wankai Tang, Jinghe Wang, Yifei Yuan, Xiao Li, Shi Jin

    Abstract: The passive reconfigurable intelligent surface (RIS) requires numerous elements to achieve adequate array gain, which linearly increases power consumption (PC) with the number of reflection phases. To address this, this letter introduces a rotatable block-controlled RIS (BC-RIS) that preserves spectral efficiency (SE) while reducing power costs. Unlike the element-controlled RIS (EC-RIS), which ne… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  26. arXiv:2408.11631  [pdf, other

    cs.SE

    Uncovering and Mitigating the Impact of Frozen Package Versions for Fixed-Release Linux

    Authors: Wei Tang, Zhengzi Xu, Chengwei Liu, Ping Luo, Yang Liu

    Abstract: Towards understanding the ecosystem gap of fixed-release Linux that is caused by the evolution of mirrors, we conducted a comprehensive study of the Debian ecosystem. This study involved the collection of Debian packages and the construction of the dependency graph of the Debian ecosystem. Utilizing historic snapshots of Debian mirrors, we were able to recover the evolution of the dependency graph… ▽ More

    Submitted 11 September, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

  27. arXiv:2408.07422  [pdf, other

    cs.CV cs.AI

    LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image

    Authors: Fan Yang, Sicheng Zhao, Yanhao Zhang, Haoxiang Chen, Hui Chen, Wenbo Tang, Haonan Lu, Pengfei Xu, Zhenyu Yang, Jungong Han, Guiguang Ding

    Abstract: Recent advancements in autonomous driving, augmented reality, robotics, and embodied intelligence have necessitated 3D perception algorithms. However, current 3D perception methods, particularly small models, struggle with processing logical reasoning, question-answering, and handling open scenario categories. On the other hand, generative multimodal large language models (MLLMs) excel in general… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  28. arXiv:2408.02839  [pdf, other

    stat.ML cs.LG

    Optimizing Cox Models with Stochastic Gradient Descent: Theoretical Foundations and Practical Guidances

    Authors: Lang Zeng, Weijing Tang, Zhao Ren, Ying Ding

    Abstract: Optimizing Cox regression and its neural network variants poses substantial computational challenges in large-scale studies. Stochastic gradient descent (SGD), known for its scalability in model optimization, has recently been adapted to optimize Cox models. Unlike its conventional application, which typically targets a sum of independent individual loss, SGD for Cox models updates parameters base… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  29. arXiv:2408.01668  [pdf, other

    cs.CV cs.MM

    Multiple Contexts and Frequencies Aggregation Network forDeepfake Detection

    Authors: Zifeng Li, Wenzhong Tang, Shijun Gao, Shuai Wang, Yanxiang Wang

    Abstract: Deepfake detection faces increasing challenges since the fast growth of generative models in developing massive and diverse Deepfake technologies. Recent advances rely on introducing heuristic features from spatial or frequency domains rather than modeling general forgery features within backbones. To address this issue, we turn to the backbone design with two intuitive priors from spatial and fre… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  30. HAIGEN: Towards Human-AI Collaboration for Facilitating Creativity and Style Generation in Fashion Design

    Authors: Jianan Jiang, Di Wu, Hanhui Deng, Yidan Long, Wenyi Tang, Xiang Li, Can Liu, Zhanpeng Jin, Wenlei Zhang, Tangquan Qi

    Abstract: The process of fashion design usually involves sketching, refining, and coloring, with designers drawing inspiration from various images to fuel their creative endeavors. However, conventional image search methods often yield irrelevant results, impeding the design process. Moreover, creating and coloring sketches can be time-consuming and demanding, acting as a bottleneck in the design workflow.… ▽ More

    Submitted 30 September, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted by Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (ACM IMWUT/UbiComp 2024)

  31. arXiv:2407.17689  [pdf, other

    cs.CV

    SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification

    Authors: Heng Fang, Sheng Huang, Wenhao Tang, Luwen Huangfu, Bo Liu

    Abstract: Multiple Instance Learning (MIL) represents the predominant framework in Whole Slide Image (WSI) classification, covering aspects such as sub-typing, diagnosis, and beyond. Current MIL models predominantly rely on instance-level features derived from pretrained models such as ResNet. These models segment each WSI into independent patches and extract features from these local patches, leading to a… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: accepted by ACM Multimedia 2024

  32. arXiv:2407.11816  [pdf, other

    cs.PL

    Modal Effect Types

    Authors: Wenhao Tang, Leo White, Stephen Dolan, Daniel Hillerstr�m, Sam Lindley, Anton Lorenzen

    Abstract: We propose a novel type system for effects and handlers using modal types. Conventional effect systems attach effects to function types, which can lead to verbose effect-polymorphic types, especially for higher-order functions. Our modal effect system provides succinct types for higher-order first-class functions without losing modularity and reusability. The core idea is to decouple effects from… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 76 pages

  33. arXiv:2407.11553  [pdf, other

    eess.SP cs.AI

    Learning Global and Local Features of Power Load Series Through Transformer and 2D-CNN: An Image-based Multi-step Forecasting Approach Incorporating Phase Space Reconstruction

    Authors: Zihan Tang, Tianyao Ji, Wenhu Tang

    Abstract: As modern power systems continue to evolve, accurate power load forecasting remains a critical issue in energy management. The phase space reconstruction method can effectively retain the inner chaotic property of power load from a system dynamics perspective and thus is a promising knowledge-based preprocessing method for short-term forecasting. In order to fully utilize the capability of PSR met… ▽ More

    Submitted 28 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  34. arXiv:2407.05434  [pdf, other

    cs.CL cs.AI

    LTLBench: Towards Benchmarks for Evaluating Temporal Logic Reasoning in Large Language Models

    Authors: Weizhi Tang, Vaishak Belle

    Abstract: Temporal reasoning (TR) is a critical component of artificial intelligence, encompassing understanding and processing temporal information and relationships between events. To discover and study the TR ability in Large Language Models (LLMs), various datasets have been constructed in different ways for evaluating various aspects of TR ability. Our work proposes a novel approach to design and devel… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  35. arXiv:2407.00497  [pdf, other

    cs.CL

    LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement

    Authors: Jiahao Ying, Mingbao Lin, Yixin Cao, Wei Tang, Bo Wang, Qianru Sun, Xuanjing Huang, Shuicheng Yan

    Abstract: This paper introduces the innovative "LLMs-as-Instructors" framework, which leverages the advanced Large Language Models (LLMs) to autonomously enhance the training of smaller target models. Inspired by the theory of "Learning from Errors", this framework employs an instructor LLM to meticulously analyze the specific errors within a target model, facilitating targeted and efficient training cycles… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  36. arXiv:2407.00394  [pdf

    physics.plasm-ph cs.DC cs.PF physics.comp-ph

    Understanding Large-Scale Plasma Simulation Challenges for Fusion Energy on Supercomputers

    Authors: Jeremy J. Williams, Ashish Bhole, Dylan Kierans, Matthias Hoelzl, Ihor Holod, Weikang Tang, David Tskhakaya, Stefan Costea, Leon Kos, Ales Podolnik, Jakub Hromadka, JOREK Team, Erwin Laure, Stefano Markidis

    Abstract: Understanding plasma instabilities is essential for achieving sustainable fusion energy, with large-scale plasma simulations playing a crucial role in both the design and development of next-generation fusion energy devices and the modelling of industrial plasmas. To achieve sustainable fusion energy, it is essential to accurately model and predict plasma behavior under extreme conditions, requiri… ▽ More

    Submitted 30 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted by EPS PLASMA 2024 (50th European Physical Society Conference on Plasma Physics, Vol. 48A, ISBN: 111-22-33333-44-5), prepared in the standardized EPS conference proceedings format and consists of 4 pages, which includes the main text, references, and figures

  37. arXiv:2406.13167  [pdf, other

    cs.CL

    QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism

    Authors: Bo Wang, Heyan Huang, Yixin Cao, Jiahao Ying, Wei Tang, Chong Feng

    Abstract: While large language models (LLMs) have made notable advancements in natural language processing, they continue to struggle with processing extensive text. Memory mechanism offers a flexible solution for managing long contexts, utilizing techniques such as compression, summarization, and structuring to facilitate nuanced and efficient handling of large volumes of text. However, existing techniques… ▽ More

    Submitted 26 September, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: EMNLP 2024 Findings

  38. arXiv:2406.10928  [pdf, other

    cs.CR cs.AI cs.NI

    Make Your Home Safe: Time-aware Unsupervised User Behavior Anomaly Detection in Smart Homes via Loss-guided Mask

    Authors: Jingyu Xiao, Zhiyao Xu, Qingsong Zou, Qing Li, Dan Zhao, Dong Fang, Ruoyu Li, Wenxin Tang, Kang Li, Xudong Zuo, Penghui Hu, Yong Jiang, Zixuan Weng, Michael R. Lyv

    Abstract: Smart homes, powered by the Internet of Things, offer great convenience but also pose security concerns due to abnormal behaviors, such as improper operations of users and potential attacks from malicious attackers. Several behavior modeling methods have been proposed to identify abnormal behaviors and mitigate potential risks. However, their performance often falls short because they do not effec… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: KDD 2024

  39. arXiv:2406.10831  [pdf, other

    cs.NI cs.AI cs.DC

    Design and Optimization of Hierarchical Gradient Coding for Distributed Learning at Edge Devices

    Authors: Weiheng Tang, Jingyi Li, Lin Chen, Xu Chen

    Abstract: Edge computing has recently emerged as a promising paradigm to boost the performance of distributed learning by leveraging the distributed resources at edge nodes. Architecturally, the introduction of edge nodes adds an additional intermediate layer between the master and workers in the original distributed learning systems, potentially leading to more severe straggler effect. Recently, coding the… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: The paper has been accepted by IEEE Transactions on Communications

  40. arXiv:2406.04800  [pdf, other

    cs.AI cs.CL

    Zero, Finite, and Infinite Belief History of Theory of Mind Reasoning in Large Language Models

    Authors: Weizhi Tang, Vaishak Belle

    Abstract: Large Language Models (LLMs) have recently shown a promise and emergence of Theory of Mind (ToM) ability and even outperform humans in certain ToM tasks. To evaluate and extend the boundaries of the ToM reasoning ability of LLMs, we propose a novel concept, taxonomy, and framework, the ToM reasoning with Zero, Finite, and Infinite Belief History and develop a multi-round text-based game, called… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  41. arXiv:2406.03963  [pdf, other

    cs.CL

    A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential

    Authors: Wei Tang, Yixin Cao, Jiahao Ying, Bo Wang, Yuyue Zhao, Yong Liao, Pengyuan Zhou

    Abstract: Retrieval-Augmented Generation (RAG) is an effective solution to supplement necessary knowledge to large language models (LLMs). Targeting its bottleneck of retriever performance, "generate-then-read" pipeline is proposed to replace the retrieval stage with generation from the LLM itself. Although promising, this research direction is underexplored and still cannot work in the scenario when source… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL'24 (Findings)

  42. arXiv:2406.03616  [pdf, other

    stat.ML cs.LG

    BEACON: A Bayesian Optimization Strategy for Novelty Search in Expensive Black-Box Systems

    Authors: Wei-Ting Tang, Ankush Chakrabarty, Joel A. Paulson

    Abstract: Novelty search (NS) refers to a class of exploration algorithms that automatically uncover diverse system behaviors through simulations or experiments. Systematically obtaining diverse outcomes is a key component in many real-world design problems such as material and drug discovery, neural architecture search, reinforcement learning, and robot navigation. Since the relationship between the inputs… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  43. arXiv:2406.01899  [pdf, other

    cs.LG

    Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models

    Authors: Wenzhuo Tang, Haitao Mao, Danial Dervovic, Ivan Brugere, Saumitra Mishra, Yuying Xie, Jiliang Tang

    Abstract: Models for natural language and images benefit from data scaling behavior: the more data fed into the model, the better they perform. This 'better with more' phenomenon enables the effectiveness of large-scale pre-training on vast amounts of data. However, current graph pre-training methods struggle to scale up data due to heterogeneity across graphs. To achieve effective data scaling, we aim to d… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  44. arXiv:2406.01767  [pdf, other

    cs.RO

    Region-aware Grasp Framework with Normalized Grasp Space for Efficient 6-DoF Grasping

    Authors: Siang Chen, Pengwei Xie, Wei Tang, Dingchang Hu, Yixiang Dai, Guijin Wang

    Abstract: A series of region-based methods succeed in extracting regional features and enhancing grasp detection quality. However, faced with a cluttered scene with potential collision, the definition of the grasp-relevant region stays inconsistent, and the relationship between grasps and regional spaces remains incompletely investigated. In this paper, we propose Normalized Grasp Space (NGS) from a novel r… ▽ More

    Submitted 5 September, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by CoRL2024, final camera-ready version will be updated soon

  45. arXiv:2406.01195  [pdf, other

    cs.RO

    C$^3$P-VoxelMap: Compact, Cumulative and Coalescible Probabilistic Voxel Mapping

    Authors: Xu Yang, Wenhao Li, Qijie Ge, Lulu Suo, Weijie Tang, Zhengyu Wei, Longxiang Huang, Bo Wang

    Abstract: This work presents a compact, cumulative and coalescible probabilistic voxel mapping method to enhance performance, accuracy and memory efficiency in LiDAR odometry. Probabilistic voxel mapping requires storing past point clouds and re-iterating on them to update the uncertainty every iteration, which consumes large memory space and CPU cycles. To solve this problem, we propose a two-folded strate… ▽ More

    Submitted 10 October, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  46. arXiv:2406.00429  [pdf, other

    cs.CV

    Towards Generalizable Multi-Object Tracking

    Authors: Zheng Qin, Le Wang, Sanping Zhou, Panpan Fu, Gang Hua, Wei Tang

    Abstract: Multi-Object Tracking MOT encompasses various tracking scenarios, each characterized by unique traits. Effective trackers should demonstrate a high degree of generalizability across diverse scenarios. However, existing trackers struggle to accommodate all aspects or necessitate hypothesis and experimentation to customize the association information motion and or appearance for a given scenario, le… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: CVPR2024

  47. arXiv:2405.20220  [pdf, other

    cs.DC cs.CY

    BeerReview: A Blockchain-enabled Peer Review Platform

    Authors: Guodong Jin, Zihan Zhou, Wenzheng Tang, Kanglei Yu, Hao Xu, Erwu Liu

    Abstract: In an era of increasing concerns over intellectual property rights, traditional peer review systems face challenges including plagiarism, malicious attacks, and unauthorized data access. BeerReview, a blockchain-enabled peer review platform, offers a robust solution, enabling experts and scholars to participate actively in the review process without concerns about plagiarism or security threats. F… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  48. Let Me Do It For You: Towards LLM Empowered Recommendation via Tool Learning

    Authors: Yuyue Zhao, Jiancan Wu, Xiang Wang, Wei Tang, Dingxian Wang, Maarten de Rijke

    Abstract: Conventional recommender systems (RSs) face challenges in precisely capturing users' fine-grained preferences. Large language models (LLMs) have shown capabilities in commonsense reasoning and leveraging external tools that may help address these challenges. However, existing LLM-based RSs suffer from hallucinations, misalignment between the semantic space of items and the behavior space of users,… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  49. arXiv:2405.14953  [pdf, other

    cs.LG cs.AI stat.ML

    MallowsPO: Fine-Tune Your LLM with Preference Dispersions

    Authors: Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang

    Abstract: Direct Preference Optimization (DPO) has recently emerged as a popular approach to improve reinforcement learning with human feedback (RLHF), leading to better techniques to fine-tune large language models (LLM). A weakness of DPO, however, lies in its lack of capability to characterize the diversity of human preferences. Inspired by Mallows' theory of preference ranking, we develop in this paper… ▽ More

    Submitted 2 October, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  50. arXiv:2405.07760  [pdf, other

    cs.LG stat.ML

    CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization

    Authors: Wei-Ting Tang, Joel A. Paulson

    Abstract: Bayesian optimization (BO) is a popular approach for optimizing expensive-to-evaluate black-box objective functions. An important challenge in BO is its application to high-dimensional search spaces due in large part to the curse of dimensionality. One way to overcome this challenge is to focus on local BO methods that aim to efficiently learn gradients, which have shown strong empirical performan… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.