Skip to main content

Showing 1–50 of 1,483 results for author: Luo, H

.
  1. arXiv:2410.13077  [pdf, other

    cs.CL cs.AI

    Tuning Language Models by Mixture-of-Depths Ensemble

    Authors: Haoyan Luo, Lucia Specia

    Abstract: Transformer-based Large Language Models (LLMs) traditionally rely on final-layer loss for training and final-layer representations for predictions, potentially overlooking the predictive power embedded in intermediate layers. Surprisingly, we find that focusing training efforts on these intermediate layers can yield training losses comparable to those of final layers, with complementary test-time… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  2. arXiv:2410.11370  [pdf, other

    cs.CL cs.IR

    Enhance Graph Alignment for Large Language Models

    Authors: Haitong Luo, Xuying Meng, Suhang Wang, Tianxiang Zhao, Fali Wang, Hanyun Cao, Yujun Zhang

    Abstract: Graph-structured data is prevalent in the real world. Recently, due to the powerful emergent capabilities, Large Language Models (LLMs) have shown promising performance in modeling graphs. The key to effectively applying LLMs on graphs is converting graph data into a format LLMs can comprehend. Graph-to-token approaches are popular in enabling LLMs to process graph information. They transform grap… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: Under review

  3. arXiv:2410.11363  [pdf, other

    cs.CV

    Visual-Geometric Collaborative Guidance for Affordance Learning

    Authors: Hongchen Luo, Wei Zhai, Jiao Wang, Yang Cao, Zheng-Jun Zha

    Abstract: Perceiving potential ``action possibilities'' (\ie, affordance) regions of images and learning interactive functionalities of objects from human demonstration is a challenging task due to the diversity of human-object interactions. Prevailing affordance learning algorithms often adopt the label assignment paradigm and presume that there is a unique relationship between functional region and afford… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  4. arXiv:2410.08949  [pdf, other

    cs.AI quant-ph

    Transferable Belief Model on Quantum Circuits

    Authors: Qianli Zhou, Hao Luo, Lipeng Pan, Yong Deng, Eloi Bosse

    Abstract: The transferable belief model, as a semantic interpretation of Dempster-Shafer theory, enables agents to perform reasoning and decision making in imprecise and incomplete environments. The model offers distinct semantics for handling unreliable testimonies, allowing for a more reasonable and general process of belief transfer compared to the Bayesian approach. However, because both the belief mass… ▽ More

    Submitted 17 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

  5. arXiv:2410.06982  [pdf, other

    cs.CV

    Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation

    Authors: Runze Chen, Haiyong Luo, Fang Zhao, Jingze Yu, Yupeng Jia, Juan Wang, Xuepeng Ma

    Abstract: Monocular depth estimation, enabled by self-supervised learning, is a key technique for 3D perception in computer vision. However, it faces significant challenges in real-world scenarios, which encompass adverse weather variations, motion blur, as well as scenes with poor lighting conditions at night. Our research reveals that we can divide monocular depth estimation into three sub-problems: depth… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: To be published in Asian Conference on Computer Vision 2024

  6. arXiv:2410.06955  [pdf, other

    cond-mat.str-el

    Quantum dynamics in a spin-1/2 square lattice $J_{1}$-$J_{2}$-$δ$ altermagnet

    Authors: Yang Liu, Shiqi Shao, Saisai He, Z. Y. Xie, Jia-Wei Mei, Hong-Gang Luo, Jize Zhao

    Abstract: A key feature of the newly discovered altermagnet is that its spin degeneracy is lifted, although it has a antiferromagnetic order and zero net magnetization. In this work, we investigate a frustrated spin-1/2 $J_1$-$J_2$-$δ$ Heisenberg model on the square lattice by the tensor network method in combination with the linear spin-wave theory, with our focus on both the magnon excitations and longitu… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  7. arXiv:2410.05203  [pdf, other

    cs.CV cs.AI cs.LG

    Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

    Authors: Ge Ya Luo, Gian Mario Favero, Zhi Hao Luo, Alexia Jolicoeur-Martineau, Christopher Pal

    Abstract: The Fréchet Video Distance (FVD) is a widely adopted metric for evaluating video generation distribution quality. However, its effectiveness relies on critical assumptions. Our analysis reveals three significant limitations: (1) the non-Gaussianity of the Inflated 3D Convnet (I3D) feature space; (2) the insensitivity of I3D features to temporal distortions; (3) the impractical sample sizes require… ▽ More

    Submitted 8 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

  8. arXiv:2410.02623  [pdf, other

    math.ST math.NA stat.ML

    Ranking Perspective for Tree-based Methods with Applications to Symbolic Feature Selection

    Authors: Hengrui Luo, Meng Li

    Abstract: Tree-based methods are powerful nonparametric techniques in statistics and machine learning. However, their effectiveness, particularly in finite-sample settings, is not fully understood. Recent applications have revealed their surprising ability to distinguish transformations (which we call symbolic feature selection) that remain obscure under current theoretical understanding. This work provides… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 39 pages, 6 figures

    MSC Class: 68W30; 62F07; 62G08

  9. arXiv:2410.01769  [pdf, other

    cs.CL

    Quantifying Generalization Complexity for Large Language Models

    Authors: Zhenting Qi, Hongyin Luo, Xuliang Huang, Zhuokai Zhao, Yibo Jiang, Xiangjun Fan, Himabindu Lakkaraju, James Glass

    Abstract: While large language models (LLMs) have shown exceptional capabilities in understanding complex queries and performing sophisticated tasks, their generalization abilities are often deeply entangled with memorization, necessitating more precise evaluation. To address this challenge, we introduce Scylla, a dynamic evaluation framework that quantitatively measures the generalization abilities of LLMs… ▽ More

    Submitted 3 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

  10. arXiv:2410.00907  [pdf, other

    cs.CL

    Addition is All You Need for Energy-efficient Language Models

    Authors: Hongyin Luo, Wei Sun

    Abstract: Large neural networks spend most computation on floating point tensor multiplications. In this work, we find that a floating point multiplier can be approximated by one integer adder with high precision. We propose the linear-complexity multiplication L-Mul algorithm that approximates floating point number multiplication with integer addition operations. The new algorithm costs significantly less… ▽ More

    Submitted 2 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

  11. arXiv:2409.20146  [pdf, other

    cs.CV

    VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection

    Authors: Huilin Deng, Hongchen Luo, Wei Zhai, Yang Cao, Yu Kang

    Abstract: Zero-shot anomaly detection (ZSAD) recognizes and localizes anomalies in previously unseen objects by establishing feature mapping between textual prompts and inspection images, demonstrating excellent research value in flexible industrial manufacturing. However, existing ZSAD methods are limited by closed-world settings, struggling to unseen defects with predefined prompts. Recently, adapting Mul… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  12. arXiv:2409.19650  [pdf, other

    cs.CV cs.AI

    Grounding 3D Scene Affordance From Egocentric Interactions

    Authors: Cuiyu Liu, Wei Zhai, Yuhang Yang, Hongchen Luo, Sen Liang, Yang Cao, Zheng-Jun Zha

    Abstract: Grounding 3D scene affordance aims to locate interactive regions in 3D environments, which is crucial for embodied agents to interact intelligently with their surroundings. Most existing approaches achieve this by mapping semantics to 3D instances based on static geometric structure and visual appearance. This passive strategy limits the agent's ability to actively perceive and engage with the env… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  13. arXiv:2409.19646  [pdf, other

    physics.plasm-ph

    Spatial Fluctuation of the Electric Field within SF6 Streamer Channel in Highly Non-Uniform Fields: Phenomenon, Validation, and Mechanism

    Authors: Zihao Feng, Xinxin Wang, Xiaobing Zou, Haiyun Luo, Yangyang Fu

    Abstract: The electric field within the streamer channel is a critical parameter in the calculation model for the nonlinear breakdown voltage of SF6, motivating the research presented in this paper. By using a 2D fluid model, we investigate the microscopic characteristics of the SF6 streamer channel in highly non-uniform fields and uncover a previously unexplained coherent structure: the spatial fluctuation… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  14. arXiv:2409.17892  [pdf, other

    cs.CL

    EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

    Authors: Shaoxiong Ji, Zihao Li, Indraneil Paul, Jaakko Paavola, Peiqin Lin, Pinzhen Chen, Dayy�n O'Brien, Hengyu Luo, Hinrich Sch�tze, J�rg Tiedemann, Barry Haddow

    Abstract: In this work, we introduce EMMA-500, a large-scale multilingual language model continue-trained on texts across 546 languages designed for enhanced multilingual performance, focusing on improving language coverage for low-resource languages. To facilitate continual pre-training, we compile the MaLA corpus, a comprehensive multilingual dataset enriched with curated datasets across diverse domains.… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  15. arXiv:2409.17740  [pdf, other

    cs.CV

    AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status

    Authors: Jinghao Zhang, Wen Qian, Hao Luo, Fan Wang, Feng Zhao

    Abstract: Diffusion models have made compelling progress on facilitating high-throughput daily production. Nevertheless, the appealing customized requirements are remain suffered from instance-level finetuning for authentic fidelity. Prior zero-shot customization works achieve the semantic consistence through the condensed injection of identity features, while addressing detailed low-level signatures throug… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 13 pages, 12 figures

  16. arXiv:2409.16719  [pdf, other

    nlin.CD

    Multi-functional reservoir computing

    Authors: Yao Du, Haibo Luo, Jianmin Guo, Jinghua Xiao, Yizhen Yu, Xingang Wang

    Abstract: Whereas the power of reservoir computing (RC) in inferring chaotic systems has been well established in the literature, the studies are mostly restricted to mono-functional machines where the training and testing data are acquired from the same attractor. Here, using the strategies of attractor labeling and trajectory separation, we propose a new scheme of RC capable of learning multiple attractor… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 14 pages, 7 figures

  17. arXiv:2409.12455  [pdf, other

    cs.RO

    MuxHand: A Cable-driven Dexterous Robotic Hand Using Time-division Multiplexing Motors

    Authors: Jianle Xu, Shoujie Li, Hong Luo, Houde Liu, Xueqian Wang, Wenbo Ding, Chongkun Xia

    Abstract: The robotic dexterous hand is responsible for both grasping and dexterous manipulation. The number of motors directly influences both the dexterity and the cost of such systems. In this paper, we present MuxHand, a robotic hand that employs a time-division multiplexing motor (TDMM) mechanism. This system allows 9 cables to be independently controlled by just 4 motors, significantly reducing cost w… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 7 pages

  18. arXiv:2409.11728  [pdf, ps, other

    cs.IR

    Active Reconfigurable Intelligent Surface Empowered Synthetic Aperture Radar Imaging

    Authors: Yifan Sun, Rang Liu, Zhiping Lu, Honghao Luo, Ming Li, Qian Liu

    Abstract: Synthetic Aperture Radar (SAR) utilizes the movement of the radar antenna over a specific area of interest to achieve higher spatial resolution imaging. In this paper, we aim to investigate the realization of SAR imaging for a stationary radar system with the assistance of active reconfigurable intelligent surface (ARIS) mounted on an unmanned aerial vehicle (UAV). As the UAV moves along the stati… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  19. arXiv:2409.11022  [pdf, other

    cs.CL cs.AI

    GEIC: Universal and Multilingual Named Entity Recognition with Large Language Models

    Authors: Hanjun Luo, Yingbin Jin, Xuecheng Liu, Tong Shang, Ruizhe Chen, Zuozhu Liu

    Abstract: Large Language Models (LLMs) have supplanted traditional methods in numerous natural language processing tasks. Nonetheless, in Named Entity Recognition (NER), existing LLM-based methods underperform compared to baselines and require significantly more computational resources, limiting their application. In this paper, we introduce the task of generation-based extraction and in-context classificat… ▽ More

    Submitted 25 September, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  20. arXiv:2409.08562  [pdf, other

    cs.CV

    CSS: Overcoming Pose and Scene Challenges in Crowd-Sourced 3D Gaussian Splatting

    Authors: Runze Chen, Mingyu Xiao, Haiyong Luo, Fang Zhao, Fan Wu, Hao Xiong, Qi Liu, Meng Song

    Abstract: We introduce Crowd-Sourced Splatting (CSS), a novel 3D Gaussian Splatting (3DGS) pipeline designed to overcome the challenges of pose-free scene reconstruction using crowd-sourced imagery. The dream of reconstructing historically significant but inaccessible scenes from collections of photographs has long captivated researchers. However, traditional 3D techniques struggle with missing camera poses… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

  21. arXiv:2409.07026  [pdf, ps, other

    math.RT math.CT math.KT

    Wakamatsu tilting subcategories and weak support tau-tilting subcategories in recollement

    Authors: Yongduo Wang, Hongyang Luo, Jian He, Dejun Wu

    Abstract: In this article, we prove that if (A, B, C) is a recollement of abelian categories, then wakamatsu tilting (resp. weak support tau-tilting) subcategories in A and C can induce wakamatsu tilting (resp. weak support tau-tilting) subcategories in B, and the converses hold under natural assumptions. As an application, we mainly consider the relationship of tau-cotorsion torsion triples in (A, B, C).

    Submitted 11 September, 2024; originally announced September 2024.

    MSC Class: 18G80; 18E10; 18E40

  22. arXiv:2409.04751  [pdf, other

    cs.CV cs.GR

    Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras

    Authors: Zimu Liao, Siyan Chen, Rong Fu, Yi Wang, Zhongling Su, Hao Luo, Li Ma, Linning Xu, Bo Dai, Hengjie Li, Zhilin Pei, Xingcheng Zhang

    Abstract: Recently, 3D Gaussian Splatting (3DGS) has garnered attention for its high fidelity and real-time rendering. However, adapting 3DGS to different camera models, particularly fisheye lenses, poses challenges due to the unique 3D to 2D projection calculation. Additionally, there are inefficiencies in the tile-based splatting, especially for the extreme curvature and wide field of view of fisheye lens… ▽ More

    Submitted 11 September, 2024; v1 submitted 7 September, 2024; originally announced September 2024.

  23. arXiv:2409.04363  [pdf, other

    cs.CV

    RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement

    Authors: Hao Luo, Baoliang Chen, Lingyu Zhu, Peilin Chen, Shiqi Wang

    Abstract: Scene observation from multiple perspectives would bring a more comprehensive visual experience. However, in the context of acquiring multiple views in the dark, the highly correlated views are seriously alienated, making it challenging to improve scene understanding with auxiliary views. Recent single image-based enhancement methods may not be able to provide consistently desirable restoration pe… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 14 Pages, 10 Figures, Under Review

  24. arXiv:2409.00606  [pdf, other

    cs.CV

    Style Transfer: From Stitching to Neural Networks

    Authors: Xinhe Xu, Zhuoer Wang, Yihan Zhang, Yizhou Liu, Zhaoyue Wang, Zhihao Xu, Muhan Zhao, Huaiying Luo

    Abstract: This article compares two style transfer methods in image processing: the traditional method, which synthesizes new images by stitching together small patches from existing images, and a modern machine learning-based approach that uses a segmentation network to isolate foreground objects and apply style transfer solely to the background. The traditional method excels in creating artistic abstracti… ▽ More

    Submitted 15 September, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

  25. arXiv:2408.17162  [pdf, other

    cs.LG cs.AI

    Deep Feature Embedding for Tabular Data

    Authors: Yuqian Wu, Hengyi Luo, Raymond S. T. Lee

    Abstract: Tabular data learning has extensive applications in deep learning but its existing embedding techniques are limited in numerical and categorical features such as the inability to capture complex relationships and engineering. This paper proposes a novel deep embedding framework with leverages lightweight deep neural networks to generate effective feature embeddings for tabular data in machine lear… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 15 pages, 2figures, accepted to ICONIP 2024, Paper ID: 1399

  26. arXiv:2408.13994  [pdf, other

    math.CO

    Tur�n number of complete bipartite graphs with bounded matching number

    Authors: Huan Luo, Xiamiao Zhao, Mei Lu

    Abstract: Let $\mathscr{F}$ be a family of graphs. A graph $G$ is $\mathscr{F}$-free if $G$ does not contain any $F\in \mathcal{F}$ as a subgraph. The Tur�n number $ex(n, \mathscr{F})$ is the maximum number of edges in an $n$-vertex $\mathscr{F}$-free graph. Let $M_{s}$ be the matching consisting of $ s $ independent edges. Recently, Alon and Frank determined the exact value of $ex(n,\{K_{m},M_{s+1}\})$. Ge… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: 15 pages, 2 figures

  27. arXiv:2408.13547  [pdf, other

    math.NA math.OC math.ST

    Frontal Slice Approaches for Tensor Linear Systems

    Authors: Hengrui Luo, Anna Ma

    Abstract: Inspired by the row and column action methods for solving large-scale linear systems, in this work, we explore the use of frontal slices for solving tensor linear systems. In particular, this paper presents a novel approach for using frontal slices of a tensor $\mathcal{A}$ to solve tensor linear systems $\mathcal{A} * \mathcal{X} = \mathcal{B}$ where $*$ denotes the t-product. In addition, we con… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 41 pages, 10 figures

    MSC Class: 15A69; 15A72; 65F10

  28. arXiv:2408.13085  [pdf, other

    cs.CV cs.AI

    Map-Free Visual Relocalization Enhanced by Instance Knowledge and Depth Knowledge

    Authors: Mingyu Xiao, Runze Chen, Haiyong Luo, Fang Zhao, Juan Wang, Xuepeng Ma

    Abstract: Map-free relocalization technology is crucial for applications in autonomous navigation and augmented reality, but relying on pre-built maps is often impractical. It faces significant challenges due to limitations in matching methods and the inherent lack of scale in monocular images. These issues lead to substantial rotational and metric errors and even localization failures in real-world scenari… ▽ More

    Submitted 18 September, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

    Comments: 17 pages,6 figures

  29. arXiv:2408.12760  [pdf, other

    eess.IV cs.CV

    Hierarchical Attention and Parallel Filter Fusion Network for Multi-Source Data Classification

    Authors: Han Luo, Feng Gao, Junyu Dong, Lin Qi

    Abstract: Hyperspectral image (HSI) and synthetic aperture radar (SAR) data joint classification is a crucial and yet challenging task in the field of remote sensing image interpretation. However, feature modeling in existing methods is deficient to exploit the abundant global, spectral, and local features simultaneously, leading to sub-optimal classification performance. To solve the problem, we propose a… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by IEEE GRSL

  30. arXiv:2408.12164  [pdf

    cond-mat.mtrl-sci

    Excellent and CO$_2$$_{0.85}$Nd$_{0.1}$Cu$_{0.05}$O$_{2-δ}$-Nd$_x$Sr$_{1-x}$Fe$_{1-y}$Cu$_y$O$_{3-δ}$ dual-phase oxygen transport membranes

    Authors: Chao Zhang, Yue Zhu, Xiaopeng Wang, Yanhao Huang, Lingyong Zeng, Kuan Li, Peifeng Yu, Kangwang Wang, Longfu Li, Zaichen Xiang, Rui Chen, Xuefeng Zhu, Huixia Luo

    Abstract: Oxygen transport membranes(OTMs)have provided great opportunities in the last decades but are suffering from the trade-off effect between stability and oxygen permeability. Here, we report a group of new planar dual-phase mixed ionic-electronic conducting (MIEC) OTMs consisting of CO$_2$$_{0.85}$Nd$_{0.1}$Cu$_{0.05}$O$_2$ (CNCO) and Nd$_x$Sr$_{1-x}$Fe$_{1-y}$Cu$_y$O$_3$(NSFCO; $x = 0.4, 0.6$;… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 36 pages, 6 figures

    Journal ref: Journal of Membrane Science, 2024,696,122485

  31. arXiv:2408.12160  [pdf

    cond-mat.mtrl-sci cond-mat.supr-con

    Mapping Hydrogen Evolution Activity Trends of V-based A15 Superconducting Alloys

    Authors: Peifeng Yu, Jie Zhan, Xiaobing Zhang, Kangwang Wang, Lingyong Zeng, Kuan Li, Chao Zhang, Longfu Li, Ying Liang, Kai Yan, Yan Sun, Huixia Luo

    Abstract: Exploring high-efficiency and low-cost electrocatalysts is valuable for water-splitting technologies. Recently, Si-group compounds have attracted increasing attention in electrocatalysis, considering the abundant Si-group elements on Earth. However, Si-group compounds for HER electrocatalysis have not been systematically studied. In this study, we unveil the activity trends of non-noble metal cata… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 25 pages, 5 figures

    Journal ref: Chemical Engineering Journal,2024, 488, 150961

  32. arXiv:2408.11387  [pdf

    cond-mat.supr-con

    Structural and Superconducting Properties in the Te-doped Spinel CuRh2Se4

    Authors: Kuan Li, Lingyong Zeng, Longfu Li, Rui Chen, Peifeng Yu, Kangwang Wang, Chao Zhang, Zaichen Xiang, Huixia Luo

    Abstract: In this paper, we discuss the impact of tellurium (Te) doping on the spinel superconductor CuRh2Se4. We conducted a comprehensive evaluation of the structural and superconducting properties of the system using various techniques, including X-ray diffraction (XRD), resistivity, magnetization, and specific heat measurements. Based on our XRD analysis, we found that the spinel superconductor CuRh2Se4… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 25 pages, 6 figures, 1 table

    Journal ref: Journal of Alloys and Compounds, 2024, 995, 174756

  33. arXiv:2408.11373  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Revealing the nontrivial topological surface states of catalysts for effective photochemical carbon dioxide conversion

    Authors: Kangwang Wang, Longfu Li, Peifeng Yu, Nannan Tang, Lingyong Zeng, Kuan Li, Chao Zhang, Rui Chen, Zaichen Xiang, Huichao Wang, Yongqing Cai, Kai Yan, Huixia Luo

    Abstract: Topological semimetals with protected surface states mark a new paradigm of research beyond the early landmarks of band-structure engineering, allowing fabrication of efficient catalyst to harness the rich metallic surface states to activate specific chemical processes. Herein, we demonstrate a facile solid-phase method for in-situ doping of Ir at the Os sites in the Os3Sn7, an alloy with topologi… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 33 Pages, 6 Figures, 1 Table

    Journal ref: Applied Catalysis B: Environment and Energy,2024,358,124428

  34. arXiv:2408.11369  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Non-trivial Topological Surface States Regulation of 1T-OsCoTe$_2$ Enables Selective C-C Coupling for Highly Efficient Photochemical CO$_2$ Reduction Toward C$_{2+}$ hydrocarbons

    Authors: Kangwang Wang, Mingjie Wu, Peifeng Yu, Hector F. Garces, Ying Liang, Longfu Li, Lingyong Zeng, Kuan Li, Chao Zhang, Kai Yan, Huixia Luo

    Abstract: Despite ongoing research, the rational design of nontrivial topological semimetal surface states for the selective photocatalytic CO$_2$ conversion into valuable products remains full of challenges. Herein, we present the synthesis of 1T-OsCoTe$_2$ for the photoreduction upgrading of CO$_2$ to tricarbon alkane C$_3$H$_8$,by the integration of experimental work and theory calculation. Experimental… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 31 pages, 6 Figures

    Journal ref: Applied Catalysis B: Environment and Energy,2024,352,124058

  35. arXiv:2408.09506  [pdf, other

    cs.DB

    The Story Behind the Lines: Line Charts as a Gateway to Dataset Discovery

    Authors: Daomin Ji, Hui Luo, Zhifeng Bao, J. Shane Culpepper

    Abstract: Line charts are a valuable tool for data analysis and exploration, distilling essential insights from a dataset. However, access to the underlying dataset behind a line chart is rarely readily available. In this paper, we explore a novel dataset discovery problem, dataset discovery via line charts, focusing on the use of line charts as queries to discover datasets within a large data repository th… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  36. arXiv:2408.05913  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Doping Dependence of Spin-Momentum Locking in Bismuth-Based High-Temperature Cuprate Superconductors

    Authors: Hailan Luo, Kayla Currier, Chiu-Yun Lin, Kenneth Gotlieb, Ryo Mori, Hiroshi Eisaki, Alexei Fedorov, Zahid Hussain, Alessandra Lanzara

    Abstract: Non-zero spin orbit coupling has been reported in several unconventional superconductors due to the absence of inversion symmetry breaking. This contrasts with cuprate superconductors, where such interaction has been neglected for a long time. The recent report of a non-trivial spin orbit coupling in overdoped Bi2212 cuprate superconductor, has re-opened an old debate on both the source and role o… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 6 figures

    Journal ref: Commun Mater 5, 140 (2024)

  37. arXiv:2408.05776  [pdf

    cs.NI eess.SP

    Convergence of Symbiotic Communications and Blockchain for Sustainable and Trustworthy 6G Wireless Networks

    Authors: Haoxiang Luo, Gang Sun, Cheng Chi, Hongfang Yu, Mohsen Guizani

    Abstract: Symbiotic communication (SC) is known as a new wireless communication paradigm, similar to the natural ecosystem population, and can enable multiple communication systems to cooperate and mutualize through service exchange and resource sharing. As a result, SC is seen as an important potential technology for future sixth-generation (6G) communications, solving the problem of lack of spectrum resou… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  38. arXiv:2408.04170  [pdf

    cs.CV

    M2EF-NNs: Multimodal Multi-instance Evidence Fusion Neural Networks for Cancer Survival Prediction

    Authors: Hui Luo, Jiashuang Huang, Hengrong Ju, Tianyi Zhou, Weiping Ding

    Abstract: Accurate cancer survival prediction is crucial for assisting clinical doctors in formulating treatment plans. Multimodal data, including histopathological images and genomic data, offer complementary and comprehensive information that can greatly enhance the accuracy of this task. However, the current methods, despite yielding promising results, suffer from two notable limitations: they do not eff… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  39. arXiv:2408.01926  [pdf, other

    cs.LG stat.ME stat.ML

    Efficient Decision Trees for Tensor Regressions

    Authors: Hengrui Luo, Akira Horiguchi, Li Ma

    Abstract: We proposed the tensor-input tree (TT) method for scalar-on-tensor and tensor-on-tensor regression problems. We first address scalar-on-tensor problem by proposing scalar-output regression tree models whose input variable are tensors (i.e., multi-way arrays). We devised and implemented fast randomized and deterministic algorithms for efficient fitting of scalar-on-tensor trees, making TT competiti… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: 36 pages, 9 Figures

    MSC Class: 62G08; 15A69 ACM Class: G.3

  40. arXiv:2407.21510  [pdf, other

    cs.CV

    PEAR: Phrase-Based Hand-Object Interaction Anticipation

    Authors: Zichen Zhang, Hongchen Luo, Wei Zhai, Yang Cao, Yu Kang

    Abstract: First-person hand-object interaction anticipation aims to predict the interaction process over a forthcoming period based on current scenes and prompts. This capability is crucial for embodied intelligence and human-robot collaboration. The complete interaction process involves both pre-contact interaction intention (i.e., hand motion trends and interaction hotspots) and post-contact interaction m… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 22 pages, 10 figures, 4 tables

  41. arXiv:2407.21348  [pdf, other

    cs.RO

    SuperVINS: A visual-inertial SLAM framework integrated deep learning features

    Authors: Hongkun Luo, Chi Guo, Yang Liu, Zengke Li

    Abstract: In this article, we propose enhancements to VINS-Fusion by incorporating deep learning features and deep learning matching methods. We implemented the training of deep learning feature bag of words and utilized these features for loop closure detection. Additionally, we introduce the RANSAC algorithm in the deep learning feature matching module to optimize matching. SuperVINS, an improved version… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  42. arXiv:2407.21043  [pdf, other

    cs.CL cs.AI cs.LG

    CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning

    Authors: Yu Feng, Zhen Tian, Yifan Zhu, Zongfu Han, Haoran Luo, Guangwei Zhang, Meina Song

    Abstract: The key challenge of cross-modal domain-incremental learning (DIL) is to enable the learning model to continuously learn from novel data with different feature distributions under the same task without forgetting old ones. However, existing top-performing methods still cause high forgetting rates, by lacking intra-domain knowledge extraction and inter-domain common prompting strategy. In this pape… ▽ More

    Submitted 2 August, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM MM 2024

  43. arXiv:2407.20195  [pdf, ps, other

    math.OC math.NA

    Accelerated Primal-Dual Proximal Gradient Splitting Methods for Convex-Concave Saddle-Point Problems

    Authors: Hao Luo

    Abstract: In this paper, based a novel primal-dual dynamical model with adaptive scaling parameters and Bregman divergences, we propose new accelerated primal-dual proximal gradient splitting methods for solving bilinear saddle-point problems with provable optimal nonergodic convergence rates. For the first, using the spectral analysis, we show that a naive extension of acceleration model for unconstrained… ▽ More

    Submitted 3 September, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

  44. arXiv:2407.19524  [pdf, other

    cs.CV cs.AI

    VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary

    Authors: Hanjun Luo, Ziye Deng, Haoyu Huang, Xuecheng Liu, Ruizhe Chen, Zuozhu Liu

    Abstract: With the rapid development of Text-to-Image (T2I) models, biases in human image generation against demographic social groups become a significant concern, impacting fairness and ethical standards in AI. Some researchers propose their methods to tackle with the issue. However, existing methods are designed for specific models with fixed prompts, limiting their adaptability to the fast-evolving mode… ▽ More

    Submitted 16 August, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

  45. Power-LLaVA: Large Language and Vision Assistant for Power Transmission Line Inspection

    Authors: Jiahao Wang, Mingxuan Li, Haichen Luo, Jinguo Zhu, Aijun Yang, Mingzhe Rong, Xiaohua Wang

    Abstract: The inspection of power transmission line has achieved notable achievements in the past few years, primarily due to the integration of deep learning technology. However, current inspection approaches continue to encounter difficulties in generalization and intelligence, which restricts their further applicability. In this paper, we introduce Power-LLaVA, the first large language and vision assista… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  46. arXiv:2407.15723  [pdf, other

    cs.CL cs.AI

    DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design

    Authors: Zhi Hao Luo, Luis Lara, Ge Ya Luo, Florian Golemo, Christopher Beckham, Christopher Pal

    Abstract: Text conditioned generative models for images have yielded impressive results. Text conditioned floorplan generation as a special type of raster image generation task also received particular attention. However there are many use cases in floorpla generation where numerical properties of the generated result are more important than the aesthetics. For instance, one might want to specify sizes for… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  47. arXiv:2407.15240  [pdf, other

    cs.CV

    BIGbench: A Unified Benchmark for Social Bias in Text-to-Image Generative Models Based on Multi-modal LLM

    Authors: Hanjun Luo, Haoyu Huang, Ziye Deng, Xuecheng Liu, Ruizhe Chen, Zuozhu Liu

    Abstract: Text-to-Image (T2I) generative models are becoming increasingly crucial due to their ability to generate high-quality images, which also raises concerns about the social biases in their outputs, especially in the human generation. Sociological research has established systematic classifications of bias. However, existing bias research about T2I models conflates different types of bias, impeding me… ▽ More

    Submitted 16 August, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2405.17814

  48. arXiv:2407.13067  [pdf, other

    cs.HC cs.AI cs.CY

    Large Language Model Agents for Improving Engagement with Behavior Change Interventions: Application to Digital Mindfulness

    Authors: Harsh Kumar, Suhyeon Yoo, Angela Zavaleta Bernuy, Jiakai Shi, Huayin Luo, Joseph Williams, Anastasia Kuzminykh, Ashton Anderson, Rachel Kornfield

    Abstract: Although engagement in self-directed wellness exercises typically declines over time, integrating social support such as coaching can sustain it. However, traditional forms of support are often inaccessible due to the high costs and complex coordination. Large Language Models (LLMs) show promise in providing human-like dialogues that could emulate social support. Yet, in-depth, in situ investigati… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Under review

  49. arXiv:2407.12319  [pdf, other

    cs.CV

    Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model

    Authors: Tao Wang, Wei Wen, Jingzhi Zhai, Kang Xu, Haoming Luo

    Abstract: Point cloud segmentation is crucial for robotic visual perception and environmental understanding, enabling applications such as robotic navigation and 3D reconstruction. However, handling the sparse and unordered nature of point cloud data presents challenges for efficient and accurate segmentation. Inspired by the Mamba model's success in natural language processing, we propose the Serialized Po… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  50. arXiv:2407.11227  [pdf, other

    cond-mat.str-el cond-mat.dis-nn

    Magnetic skin effect in Pb(Fe$_{1/2}$Nb$_{1/2}$)O$_3$

    Authors: N. Giles-Donovan, A. D. Hillier, K. Ishida, B. V. Hampshire, S. R. Giblin, B. Roessli, P. M. Gehring, G. Xu, X. Li, H. Luo, S. Cochran, C. Stock

    Abstract: Relaxor-ferroelectrics display exceptional dielectric properties resulting from the underlying random dipolar fields induced by strong chemical inhomogeneity. An unusual structural aspect of relaxors is a skin-effect where the near-surface region in single crystals exhibit structures and critical phenomena that differ from the bulk. Relaxors are unique in that this skin effect extends over a macro… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 32 pages, 13 figures

    Journal ref: J. Phys.: Condens. Matter 36 435802 (2024)