Skip to main content

Showing 1–50 of 2,607 results for author: Tang, J

.
  1. arXiv:2410.13545  [pdf, ps, other

    cs.CR cs.AR

    Three-Input Ciphertext Multiplication for Homomorphic Encryption

    Authors: Sajjad Akherati, Yok Jye Tang, Xinmiao Zhang

    Abstract: Homomorphic encryption (HE) allows computations to be directly carried out on ciphertexts and is essential to privacy-preserving computing, such as neural network inference, medical diagnosis, and financial data analysis. Only addition and 2-input multiplication are defined over ciphertexts in popular HE schemes. However, many HE applications involve non-linear functions and they need to be approx… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 5 pages, 2 figures, 2 tables, conference paper

  2. arXiv:2410.13515  [pdf, other

    hep-ex hep-lat hep-ph nucl-ex

    Observation of a rare beta decay of the charmed baryon with a Graph Neural Network

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 28 pages, 6 figures

  3. arXiv:2410.13478  [pdf, other

    hep-ex

    Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  4. arXiv:2410.13368  [pdf, other

    hep-ex hep-ph

    Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  5. arXiv:2410.13088  [pdf, other

    cs.LG cs.CL cs.MM

    Self-Comparison for Dataset-Level Membership Inference in Large (Vision-)Language Models

    Authors: Jie Ren, Kangrui Chen, Chen Chen, Vikash Sehwag, Yue Xing, Jiliang Tang, Lingjuan Lyu

    Abstract: Large Language Models (LLMs) and Vision-Language Models (VLMs) have made significant advancements in a wide range of natural language processing and vision-language tasks. Access to large web-scale datasets has been a key factor in their success. However, concerns have been raised about the unauthorized use of copyrighted materials and potential copyright infringement. Existing methods, such as sa… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  6. arXiv:2410.12620  [pdf, other

    hep-ex

    Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 14 pages, 6 figures

  7. arXiv:2410.12352  [pdf, other

    cs.CE

    Private Order Flows and Builder Bidding Dynamics: The Road to Monopoly in Ethereum's Block Building Market

    Authors: Shuzheng Wang, Yue Huang, Wenqin Zhang, Yuming Huang, Xuechao Wang, Jing Tang

    Abstract: Ethereum, as a representative of Web3, adopts a novel framework called Proposer Builder Separation (PBS) to prevent the centralization of block profits in the hands of institutional Ethereum stakers. Introducing builders to generate blocks based on public transactions, PBS aims to ensure that block profits are distributed among all stakers. Through the auction among builders, only one will win the… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  8. arXiv:2410.12349  [pdf, ps, other

    eess.SP

    When atomic norm meets the G-filter: A general framework for line spectral estimation

    Authors: Bin Zhu, Jiale Tang

    Abstract: This paper proposes a novel approach for line spectral estimation which combines Georgiou's filter bank (G-filter) with atomic norm minimization (ANM). A key ingredient is a Carathéodory--Fejér-type decomposition for the covariance matrix of the filter output. The resulting optimization problem can be characterized via semidefinite programming and contains the standard ANM for line spectral estima… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 5 pages, 3 figures. Submitted to the Satellite Workshop HiPeCASP of ICASSP 2025

  9. arXiv:2410.11867  [pdf, other

    cs.HC cs.RO

    Neural Signal Operated Intelligent Robot: Human-guided Robot Maze Navigation through SSVEP

    Authors: Jiarui Tang, Tingrui Sun, Siwen Wang

    Abstract: Brain-computer Interface (BCI) applications based on steady-state visual evoked potentials (SSVEP) have the advantages of being fast, accurate and mobile. SSVEP is the EEG response evoked by visual stimuli that are presented at a specific frequency, which results in an increase in the EEG at that same frequency. In this paper, we proposed a novel human-guided maze solving robot navigation system b… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  10. arXiv:2410.11859  [pdf, other

    cs.HC cs.CY

    SouLLMate: An Adaptive LLM-Driven System for Advanced Mental Health Support and Assessment, Based on a Systematic Application Survey

    Authors: Qiming Guo, Jinwen Tang, Wenbo Sun, Haoteng Tang, Yi Shang, Wenlu Wang

    Abstract: Mental health issues significantly impact individuals' daily lives, yet many do not receive the help they need even with available online resources. This study aims to provide accessible, stigma-free, personalized, and real-time mental health support through cutting-edge AI technologies. It makes the following contributions: (1) Conducting an extensive survey of recent mental health support method… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  11. arXiv:2410.11664  [pdf, ps, other

    math-ph

    Mathematical Foundation of the U$^N(1)$ Quantum Geometric Tensor

    Authors: Xin Wang, Xu-Yang Hou, Jia-Chen Tang, Hao Guo

    Abstract: In this paper, we systematically establish the mathematical foundation for the $\text{U}^N(1)$ quantum geometric tensor (QGT) of mixed states Explicitly, we present a description based on the $\text{U}^N(1)$ principal bundle and derive a Pythagorean-like distance decomposition equation. Additionally, we offer a comprehensive comparison of its properties with those of the U(1) principal bundle desc… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  12. arXiv:2410.11607  [pdf, other

    hep-ex

    Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures

  13. arXiv:2410.11538  [pdf, other

    cs.CV

    MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark

    Authors: Bin Shan, Xiang Fei, Wei Shi, An-Lan Wang, Guozhi Tang, Lei Liao, Jingqun Tang, Xiang Bai, Can Huang

    Abstract: The comprehension of text-rich visual scenes has become a focal point for evaluating Multi-modal Large Language Models (MLLMs) due to their widespread applications. Current benchmarks tailored to the scenario emphasize perceptual capabilities, while overlooking the assessment of cognitive abilities. To address this limitation, we introduce a Multimodal benchmark towards Text-rich visual scenes, to… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures, project page: https://github.com/xfey/MCTBench?tab=readme-ov-file

  14. arXiv:2410.10819  [pdf, other

    cs.CL

    DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

    Authors: Guangxuan Xiao, Jiaming Tang, Jingwei Zuo, Junxian Guo, Shang Yang, Haotian Tang, Yao Fu, Song Han

    Abstract: Deploying long-context large language models (LLMs) is essential but poses significant computational and memory challenges. Caching all Key and Value (KV) states across all attention heads consumes substantial memory. Existing KV cache pruning methods either damage the long-context capabilities of LLMs or offer only limited efficiency improvements. In this paper, we identify that only a fraction o… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  15. arXiv:2410.09431  [pdf, other

    cs.RO

    REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds

    Authors: Binglei Zhao, Han Wang, Jian Tang, Chengzhong Ma, Hanbo Zhang, Jiayuan Zhang, Xuguang Lan, Xingyu Chen

    Abstract: Grasping has been a crucial but challenging problem in robotics for many years. One of the most important challenges is how to make grasping generalizable and robust to novel objects as well as grippers in unstructured environments. We present \regnet, a robotic grasping system that can adapt to different parallel jaws to grasp diversified objects. To support different grippers, \regnet embeds the… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  16. arXiv:2410.09411  [pdf, other

    cs.LG stat.ML

    Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study

    Authors: Pengfei He, Yingqian Cui, Han Xu, Hui Liu, Makoto Yamada, Jiliang Tang, Yue Xing

    Abstract: In-context learning (ICL) has emerged as a powerful capability for large language models (LLMs) to adapt to downstream tasks by leveraging a few (demonstration) examples. Despite its effectiveness, the mechanism behind ICL remains underexplored. To better understand how ICL integrates the examples with the knowledge learned by the LLM during pre-training (i.e., pre-training knowledge) and how the… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  17. arXiv:2410.09088  [pdf, other

    cs.CV cs.AI

    The Solution for Temporal Action Localisation Task of Perception Test Challenge 2024

    Authors: Yinan Han, Qingyuan Jiang, Hongming Mei, Yang Yang, Jinhui Tang

    Abstract: This report presents our method for Temporal Action Localisation (TAL), which focuses on identifying and classifying actions within specific time intervals throughout a video sequence. We employ a data augmentation technique by expanding the training dataset using overlapping labels from the Something-SomethingV2 dataset, enhancing the model's ability to generalize across various action classes. F… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  18. arXiv:2410.08933  [pdf, ps, other

    math.CT math.RA

    Profinite and Solid Cohomology

    Authors: Jiacheng Tang

    Abstract: Solid abelian groups, as introduced by Dustin Clausen and Peter Scholze, form a subcategory of all condensed abelian groups satisfying some ''completeness'' conditions and having favourable categorical properties. Given a profinite ring $R$, there is an associated condensed ring $\underline{R}$ which is solid. We show that the natural embedding of profinite $R$-modules into solid $\underline{R}$-m… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 27 pages

    MSC Class: 18G15; 18B25; 16W80

  19. arXiv:2410.08879  [pdf, other

    cs.CV

    Multi-modal Fusion based Q-distribution Prediction for Controlled Nuclear Fusion

    Authors: Shiao Wang, Yifeng Wang, Qingchuan Ma, Xiao Wang, Ning Yan, Qingquan Yang, Guosheng Xu, Jin Tang

    Abstract: Q-distribution prediction is a crucial research direction in controlled nuclear fusion, with deep learning emerging as a key approach to solving prediction challenges. In this paper, we leverage deep learning techniques to tackle the complexities of Q-distribution prediction. Specifically, we explore multimodal fusion methods in computer vision, integrating 2D line image data with the original 1D… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  20. arXiv:2410.08603  [pdf, other

    hep-ex

    Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  21. arXiv:2410.08256  [pdf, other

    cs.LG cs.AI cs.HC

    AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments

    Authors: Cheng Fang, Sicong Liu, Zimu Zhou, Bin Guo, Jiaqi Tang, Ke Ma, Zhiwen Yu

    Abstract: On-device adapting to continual, unpredictable domain shifts is essential for mobile applications like autonomous driving and augmented reality to deliver seamless user experiences in evolving environments. Test-time adaptation (TTA) emerges as a promising solution by tuning model parameters with unlabeled live data immediately before prediction. However, TTA's unique forward-backward-reforward pi… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: This paper is accepted by SenSys 2024. Copyright may be transferred without notice

    Journal ref: The 22th ACM Conference on Embedded Networked Sensor Systems, 2024

  22. arXiv:2410.08092  [pdf, other

    cs.CV cs.RO

    UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction from Underwater Multi-view Monocular Images

    Authors: Zeyu Chen, Jingyi Tang, Gu Wang, Shengquan Li, Xinghui Li, Xiangyang Ji, Xiu Li

    Abstract: Due to the unique characteristics of underwater environments, accurate 3D reconstruction of underwater objects poses a challenging problem in tasks such as underwater exploration and mapping. Traditional methods that rely on multiple sensor data for 3D reconstruction are time-consuming and face challenges in data acquisition in underwater scenarios. We propose UW-SDF, a framework for reconstructin… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 8 pages, 9 figures, presented at IROS 2024

  23. arXiv:2410.07854  [pdf, other

    cs.CV cs.MM

    HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with Heterogeneous Graph Adapter

    Authors: Yumiao Zhao, Bo Jiang, Xiao Wang, Qin Xu, Jin Tang

    Abstract: Adapter-based tuning methods have shown significant potential in transferring knowledge from pre-trained Vision-Language Models to the downstream tasks. However, after reviewing existing adapters, we find they generally fail to fully explore the interactions between different modalities in constructing task-specific knowledge. Also, existing works usually only focus on similarity matching between… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  24. arXiv:2410.07626  [pdf, other

    hep-ex

    Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 9 pages, 2 figures

  25. arXiv:2410.07296  [pdf, other

    cs.CV

    ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model

    Authors: Gaoge Han, Mingjiang Liang, Jinglei Tang, Yongkang Cheng, Wei Liu, Shaoli Huang

    Abstract: Generating human motion from textual descriptions is a challenging task. Existing methods either struggle with physical credibility or are limited by the complexities of physics simulations. In this paper, we present \emph{ReinDiffuse} that combines reinforcement learning with motion diffusion model to generate physically credible human motions that align with textual descriptions. Our method adap… ▽ More

    Submitted 15 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: Accepted by WACV 2025 in Round 1

  26. arXiv:2410.06500  [pdf, other

    hep-ex

    Search for the radiative decays $D^+\toγρ^+$ and $D^+\toγK^{*+}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (648 additional authors not shown)

    Abstract: We search for the radiative decays $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ using 20.3~fb$^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and the upper limits on the branching fractions of $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ at 90\% confidence level ar… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  27. arXiv:2410.05736  [pdf, ps, other

    hep-ex

    Observation of an axial-vector state in the study of $ψ(3686) \to φηη'$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (625 additional authors not shown)

    Abstract: Using (2712.4 $\pm$ 14.3)$\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, a partial wave analysis of the decay $ψ(3686) \to φηη' $ is performed with the covariant tensor approach. An axial-vector state with a mass near 2.3 $\rm GeV/c^2$ is observed for the first time. Its mass and width are measured to be 2316… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  28. arXiv:2410.05635  [pdf

    physics.gen-ph

    Novel inverse multi-objective optimization-empowered design of microperforated panels for enhanced low-frequency noise mitigation

    Authors: Duo Zhang, Yang Zhang, Sichen Yuan, Jiong Tang, Kai Zhou

    Abstract: Microperforated panels (MPPs) display excellent capacity in noise control applications owing to their high strength, simple design, and efficacy in low-frequency sound absorption. Traditionally, the development of MPPs has relied on a trial-and-error design approach. Although simple optimization-based methods have recently begun to be employed, these designs often overlook practical considerations… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 32 pages, 11 figures

  29. arXiv:2410.05298  [pdf, ps, other

    cs.LG cs.AI

    How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension

    Authors: Xinnan Dai, Haohao Qu, Yifen Shen, Bohang Zhang, Qihao Wen, Wenqi Fan, Dongsheng Li, Jiliang Tang, Caihua Shan

    Abstract: Benchmarking the capabilities and limitations of large language models (LLMs) in graph-related tasks is becoming an increasingly popular and crucial area of research. Recent studies have shown that LLMs exhibit a preliminary ability to understand graph structures and node features. However, the potential of LLMs in graph pattern mining remains largely unexplored. This is a key component in fields… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  30. arXiv:2410.05155  [pdf, other

    cond-mat.mtrl-sci physics.optics

    Formation of Anisotropic Polarons in Antimony Selenide

    Authors: Yijie Shi, Xi Wang, Zhong Wang, Zheng Zhang, Fuyong Hua, Chao Chen, Chunlong Hu, Jiang Tang, Wenxi Liang

    Abstract: Antimony Selenide (Sb$_2$Se$_3$) is an attractive candidate of photovoltaics with not yet satisfying efficiency. Beside defects, polaron formation originated from lattice distortion was proposed to account for trapping free carriers, and the subsequent photoexcitation dynamics and optoelectronic properties, but such a mechanism is still lack of structural observations. Here we directly track the p… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  31. arXiv:2410.05061  [pdf, other

    eess.SP

    Bias-VarianceTrade-off in Kalman Filter-Based Disturbance Observers

    Authors: Shilei Li, Dawei Shi, Xiaoxu Lyu, Jiawei Tang, Ling Shi

    Abstract: The performance of disturbance observers is strongly influenced by the level of prior knowledge about the disturbance model. The simultaneous input and state estimation (SISE) algorithm is widely recognized for providing unbiased minimum-variance estimates under arbitrary disturbance models. In contrast, the Kalman filter-based disturbance observer (KF-DOB) achieves minimum mean-square error estim… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  32. arXiv:2410.04360  [pdf, other

    cs.MA cs.AI

    GenSim: A General Social Simulation Platform with Large Language Model based Agents

    Authors: Jiakai Tang, Heyang Gao, Xuchen Pan, Lei Wang, Haoran Tan, Dawei Gao, Yushuo Chen, Xu Chen, Yankai Lin, Yaliang Li, Bolin Ding, Jingren Zhou, Jun Wang, Ji-Rong Wen

    Abstract: With the rapid advancement of large language models (LLMs), recent years have witnessed many promising studies on leveraging LLM-based agents to simulate human social behavior. While prior work has demonstrated significant potential across various domains, much of it has focused on specific scenarios involving a limited number of agents and has lacked the ability to adapt when errors occur during… ▽ More

    Submitted 9 October, 2024; v1 submitted 6 October, 2024; originally announced October 2024.

  33. arXiv:2410.03456  [pdf, other

    cs.CV

    Dynamic Diffusion Transformer

    Authors: Wangbo Zhao, Yizeng Han, Jiasheng Tang, Kai Wang, Yibing Song, Gao Huang, Fan Wang, Yang You

    Abstract: Diffusion Transformer (DiT), an emerging diffusion model for image generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs stem from the static inference paradigm, which inevitably introduces redundant computation in certain diffusion timesteps and spatial regions. To address this inefficiency, we propose Dynami… ▽ More

    Submitted 8 October, 2024; v1 submitted 4 October, 2024; originally announced October 2024.

  34. arXiv:2410.02637  [pdf, other

    cs.AI cs.CV

    Plots Unlock Time-Series Understanding in Multimodal Models

    Authors: Mayank Daswani, Mathias M. J. Bellaiche, Marc Wilson, Desislav Ivanov, Mikhail Papkov, Eva Schnider, Jing Tang, Kay Lamerigts, Gabriela Botea, Michael A. Sanchez, Yojan Patel, Shruthi Prabhakara, Shravya Shetty, Umesh Telang

    Abstract: While multimodal foundation models can now natively work with data beyond text, they remain underutilized in analyzing the considerable amounts of multi-dimensional time-series data in fields like healthcare, finance, and social sciences, representing a missed opportunity for richer, data-driven insights. This paper proposes a simple but effective method that leverages the existing vision encoders… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 49 pages

  35. arXiv:2410.02421  [pdf, other

    hep-ex

    Search for lepton number violating decays of $D_s^+\to h^-h^0e^+e^+$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (650 additional authors not shown)

    Abstract: Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector operating at the BEPCII collider at center-of-mass energies from 4.128 to 4.226 GeV, a search for the Majorana neutrino $ν_m$ is conducted in the lepton-number-violating decays of $D_s^+\to h^-h^0e^+e^+$. Here, $h^-$ represents a $K^-$ or $π^-$, and $h^0$ represents a $π^0$, $K_S^0$ or $φ$. No significant signal is… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  36. arXiv:2410.02165  [pdf, other

    cs.AI cs.CL

    A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization

    Authors: Yucheng Chu, Hang Li, Kaiqi Yang, Harry Shomer, Hui Liu, Yasemin Copur-Gencturk, Jiliang Tang

    Abstract: Open-ended short-answer questions (SAGs) have been widely recognized as a powerful tool for providing deeper insights into learners' responses in the context of learning analytics (LA). However, SAGs often present challenges in practice due to the high grading workload and concerns about inconsistent assessments. With recent advancements in natural language processing (NLP), automatic short-answer… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  37. arXiv:2410.00379  [pdf, other

    cs.CV cs.AI cs.LG

    CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset

    Authors: Xiao Wang, Fuling Wang, Yuehang Li, Qingchuan Ma, Shiao Wang, Bo Jiang, Chuanfu Li, Jin Tang

    Abstract: X-ray image-based medical report generation (MRG) is a pivotal area in artificial intelligence which can significantly reduce diagnostic burdens and patient wait times. Despite significant progress, we believe that the task has reached a bottleneck due to the limited benchmark datasets and the existing large models' insufficient capability enhancements in this specialized domain. Specifically, the… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: In Peer Review

  38. arXiv:2409.19272  [pdf, other

    cs.CL

    Perception Compressor:A training-free prompt compression method in long context scenarios

    Authors: Jiwei Tang, Jin Xu, Tingwei Lu, Hai Lin, Yiming Zhao, Hai-Tao Zheng

    Abstract: Large Language Models (LLMs) demonstrate exceptional capabilities in various scenarios. However, they suffer from much redundant information and tend to be lost in the middle in long context scenarios, leading to inferior performance. To address these challenges, we present Perception Compressor, a training-free prompt compression method. It includes a dual-slope ratio allocator to dynamically ass… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: 9 pages, 2 figures

  39. arXiv:2409.19223  [pdf, other

    cs.CV eess.SP

    Summit Vitals: Multi-Camera and Multi-Signal Biosensing at High Altitudes

    Authors: Ke Liu, Jiankai Tang, Zhang Jiang, Yuntao Wang, Xiaojing Liu, Dong Li, Yuanchun Shi

    Abstract: Video photoplethysmography (vPPG) is an emerging method for non-invasive and convenient measurement of physiological signals, utilizing two primary approaches: remote video PPG (rPPG) and contact video PPG (cPPG). Monitoring vitals in high-altitude environments, where heart rates tend to increase and blood oxygen levels often decrease, presents significant challenges. To address these issues, we i… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: Accepted by UIC'24, 8 pages, 5 figures. Ke Liu and Jiankai Tang are co-first authors. Yuntao Wang and Xiaojing Liu are co-corresponding authors

  40. arXiv:2409.18707  [pdf, other

    cs.RO

    Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation

    Authors: Kun Wu, Yichen Zhu, Jinming Li, Junjie Wen, Ning Liu, Zhiyuan Xu, Qinru Qiu, Jian Tang

    Abstract: Learning visuomotor policy for multi-task robotic manipulation has been a long-standing challenge for the robotics community. The difficulty lies in the diversity of action space: typically, a goal can be accomplished in multiple ways, resulting in a multimodal action distribution for a single task. The complexity of action distribution escalates as the number of tasks increases. In this work, we… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  41. arXiv:2409.18114  [pdf, other

    cs.CV

    EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation

    Authors: Jiaxiang Tang, Zhaoshuo Li, Zekun Hao, Xian Liu, Gang Zeng, Ming-Yu Liu, Qinsheng Zhang

    Abstract: Current auto-regressive mesh generation methods suffer from issues such as incompleteness, insufficient detail, and poor generalization. In this paper, we propose an Auto-regressive Auto-encoder (ArAE) model capable of generating high-quality 3D meshes with up to 4,000 faces at a spatial resolution of $512^3$. We introduce a novel mesh tokenization algorithm that efficiently compresses triangular… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Project Page: https://research.nvidia.com/labs/dir/edgerunner/

  42. arXiv:2409.17983  [pdf, other

    astro-ph.HE

    GRB 240529A: A Tale of Two Shocks

    Authors: Tian-Rui Sun, Jin-Jun Geng, Jing-Zhi Yan, You-Dong Hu, Xue-Feng Wu, Alberto J. Castro-Tirado, Chao Yang, Yi-Ding Ping, Chen-Ran Hu, Fan Xu, Hao-Xuan Gao, Ji-An Jiang, Yan-Tian Zhu, Yongquan Xue, Ignacio P�rez-Garc�a, Si-Yu Wu, Emilio Fern�ndez-Garc�a, Mar�a D. Caballero-Garc�a, Rub�n S�nchez-Ram�rez, Sergiy Guziy, Ignacio Olivares, Carlos Jesus P�rez del Pulgar, A. Castell�n, Sebasti�n Castillo, Ding-Rong Xiong , et al. (44 additional authors not shown)

    Abstract: Thanks to the rapidly increasing time-domain facilities, we are entering a golden era of research on gamma-ray bursts (GRBs). In this Letter, we report our observations of GRB 240529A with the Burst Optical Observer and Transient Exploring System, the 1.5-meter telescope at Observatorio Sierra Nevada, the 2.5-meter Wide Field Survey Telescope of China, the Large Binocular Telescope, and the Telesc… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Resubmitted to ApJL after addressing the referee's comments; comments are welcome

  43. arXiv:2409.17440  [pdf, other

    cs.AI

    A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction

    Authors: Guangyu Wang, Yujie Chen, Ming Gao, Zhiqiao Wu, Jiafu Tang, Jiabi Zhao

    Abstract: Accurate traffic prediction faces significant challenges, necessitating a deep understanding of both temporal and spatial cues and their complex interactions across multiple variables. Recent advancements in traffic prediction systems are primarily due to the development of complex sequence-centric models. However, existing approaches often embed multiple variables and spatial relationships at eac… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 20 pages, 4 figures

  44. arXiv:2409.16600  [pdf, other

    cs.CV

    FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation

    Authors: Jingyi Tang, Gu Wang, Zeyu Chen, Shengquan Li, Xiu Li, Xiangyang Ji

    Abstract: Although methods for estimating the pose of objects in indoor scenes have achieved great success, the pose estimation of underwater objects remains challenging due to difficulties brought by the complex underwater environment, such as degraded illumination, blurring, and the substantial cost of obtaining real annotations. In response, we introduce FAFA, a Frequency-Aware Flow-Aided self-supervised… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: ECCV 2024

  45. arXiv:2409.15727  [pdf, other

    cs.CV

    LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation

    Authors: Ruida Zhang, Ziqin Huang, Gu Wang, Chenyangguang Zhang, Yan Di, Xingxing Zuo, Jiwen Tang, Xiangyang Ji

    Abstract: While RGBD-based methods for category-level object pose estimation hold promise, their reliance on depth data limits their applicability in diverse scenarios. In response, recent efforts have turned to RGB-based methods; however, they face significant challenges stemming from the absence of depth information. On one hand, the lack of depth exacerbates the difficulty in handling intra-class shape v… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: Accepted by ECCV 2024

  46. arXiv:2409.15436  [pdf, other

    cs.HC

    GenAI Advertising: Risks of Personalizing Ads with LLMs

    Authors: Brian Jay Tang, Kaiwen Sun, Noah T. Curran, Florian Schaub, Kang G. Shin

    Abstract: Recent advances in large language models have enabled the creation of highly effective chatbots, which may serve as a platform for targeted advertising. This paper investigates the risks of personalizing advertising in chatbots to their users. We developed a chatbot that embeds personalized product advertisements within LLM responses, inspired by similar forays by AI companies. Our benchmarks show… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  47. arXiv:2409.15044  [pdf, ps, other

    hep-ex

    Search for $D^0\to K^-ηe^+ν_e$, $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 7.93 fb$^{-1}$, collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we search for the semileptonic decays $D^0\to K^-ηe^+ν_e$, $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ for the first time. We present evidence for $D^0\to K^-ηe^+ν_e$ with a significance of $3.3σ$. The branching fraction… ▽ More

    Submitted 24 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: 10 pages,4 figures

  48. arXiv:2409.14411  [pdf, other

    cs.RO

    Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation

    Authors: Minjie Zhu, Yichen Zhu, Jinming Li, Junjie Wen, Zhiyuan Xu, Ning Liu, Ran Cheng, Chaomin Shen, Yaxin Peng, Feifei Feng, Jian Tang

    Abstract: Diffusion Policy is a powerful technique tool for learning end-to-end visuomotor robot control. It is expected that Diffusion Policy possesses scalability, a key attribute for deep neural networks, typically suggesting that increasing model size would lead to enhanced performance. However, our observations indicate that Diffusion Policy in transformer architecture (\DP) struggles to scale effectiv… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  49. arXiv:2409.14088  [pdf, ps, other

    cs.IT eess.SP

    Intelligent Reflecting Surface-Aided Multiuser Communication: Co-design of Transmit Diversity and Active/Passive Precoding

    Authors: Beixiong Zheng, Tiantian Ma, Jie Tang, Changsheng You, Shaoe Lin, Kai-Kit Wong

    Abstract: Intelligent reflecting surface (IRS) has become a cost-effective solution for constructing a smart and adaptive radio environment. Most previous works on IRS have jointly designed the active and passive precoding based on perfectly or partially known channel state information (CSI). However, in delay-sensitive or high-mobility communications, it is imperative to explore more effective methods for… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

    Comments: 13 pages, 9 figures, Early Access in IEEE TWC

  50. arXiv:2409.13730  [pdf, other

    cs.AI cs.CL

    VisScience: An Extensive Benchmark for Evaluating K12 Educational Multi-modal Scientific Reasoning

    Authors: Zhihuan Jiang, Zhen Yang, Jinhao Chen, Zhengxiao Du, Weihan Wang, Bin Xu, Yuxiao Dong, Jie Tang

    Abstract: Multi-modal large language models (MLLMs) have demonstrated promising capabilities across various tasks by integrating textual and visual information to achieve visual understanding in complex scenarios. Despite the availability of several benchmarks aims to evaluating MLLMs in tasks from visual question answering to complex problem-solving, most focus predominantly on mathematics or general visua… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: 89 pages, 70 figures