Skip to main content

Showing 1–50 of 128 results for author: Sun, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2409.09469  [pdf, other

    stat.ML cs.LG eess.SP q-bio.QM

    Hyperedge Representations with Hypergraph Wavelets: Applications to Spatial Transcriptomics

    Authors: Xingzhi Sun, Charles Xu, Jo�o F. Rocha, Chen Liu, Benjamin Hollander-Bodie, Laney Goldman, Marcello DiStasio, Michael Perlmutter, Smita Krishnaswamy

    Abstract: In many data-driven applications, higher-order relationships among multiple objects are essential in capturing complex interactions. Hypergraphs, which generalize graphs by allowing edges to connect any number of nodes, provide a flexible and powerful framework for modeling such higher-order relationships. In this work, we introduce hypergraph diffusion wavelets and describe their favorable spectr… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

  2. arXiv:2409.05289  [pdf, other

    cs.RO eess.SY

    Developing Trajectory Planning with Behavioral Cloning and Proximal Policy Optimization for Path-Tracking and Static Obstacle Nudging

    Authors: Mingyan Zhou, Biao Wang, Xiatao Sun

    Abstract: End-to-end approaches with Reinforcement Learning (RL) and Imitation Learning (IL) have gained increasing popularity in autonomous driving. However, they do not involve explicit reasoning like classic robotics workflow, nor planning with horizons, leading strategies implicit and myopic. In this paper, we introduce our trajectory planning method that uses Behavioral Cloning (BC) for path-tracking a… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 6 pages, 7 figures

  3. arXiv:2409.00356  [pdf, other

    cs.SD cs.AI eess.AS

    Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology

    Authors: Weinan Dai, Yifeng Jiang, Yuanjing Liu, Jinkun Chen, Xin Sun, Jinglei Tao

    Abstract: This paper addresses the persistent challenge in Keyword Spotting (KWS), a fundamental component in speech technology, regarding the acquisition of substantial labeled data for training. Given the difficulty in obtaining large quantities of positive samples and the laborious process of collecting new target samples when the keyword changes, we introduce a novel approach combining unsupervised cont… ▽ More

    Submitted 31 August, 2024; originally announced September 2024.

    Comments: This paper has been accepted by the ICPR2024

  4. arXiv:2408.13733  [pdf, other

    eess.IV cs.CV

    Anatomical Consistency Distillation and Inconsistency Synthesis for Brain Tumor Segmentation with Missing Modalities

    Authors: Zheyu Zhang, Xinzhao Liu, Zheng Chen, Yueyi Zhang, Huanjing Yue, Yunwei Ou, Xiaoyan Sun

    Abstract: Multi-modal Magnetic Resonance Imaging (MRI) is imperative for accurate brain tumor segmentation, offering indispensable complementary information. Nonetheless, the absence of modalities poses significant challenges in achieving precise segmentation. Recognizing the shared anatomical structures between mono-modal and multi-modal representations, it is noteworthy that mono-modal images typically ex… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: Accepted Paper to European Conference on Artificial Intelligence (ECAI 2024)

  5. arXiv:2408.10378  [pdf, other

    math.OC eess.SY

    Finite-time input-to-state stability for infinite-dimensional systems

    Authors: Xiaorong Sun, Jun Zheng, Guchuan Zhu

    Abstract: In this paper, we extend the notion of finite-time input-to-state stability (FTISS) for finite-dimensional systems to infinite-dimensional systems. More specifically, we first prove an FTISS Lyapunov theorem for a class of infinite-dimensional systems, namely, the existence of an FTISS Lyapunov functional (FTISS-LF) implies the FTISS of the system, and then, provide a sufficient condition for ensu… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  6. arXiv:2408.08669  [pdf, other

    cs.SD eess.AS

    HSDreport: Heart Sound Diagnosis with Echocardiography Reports

    Authors: Zihan Zhao, Pingjie Wang, Liudan Zhao, Yuchen Yang, Ya Zhang, Kun Sun, Xin Sun, Xin Zhou, Yu Wang, Yanfeng Wang

    Abstract: Heart sound auscultation holds significant importance in the diagnosis of congenital heart disease. However, existing methods for Heart Sound Diagnosis (HSD) tasks are predominantly limited to a few fixed categories, framing the HSD task as a rigid classification problem that does not fully align with medical practice and offers only limited information to physicians. Besides, such methods do not… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  7. arXiv:2408.02085  [pdf, other

    cs.CV cs.AI cs.CL eess.SP

    Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

    Authors: Yulei Qin, Yuncheng Yang, Pengcheng Guo, Gang Li, Hang Shao, Yuchen Shi, Zihan Xu, Yun Gu, Ke Li, Xing Sun

    Abstract: Instruction tuning plays a critical role in aligning large language models (LLMs) with human preference. Despite the vast amount of open instruction datasets, naively training a LLM on all existing instructions may not be optimal and practical. To pinpoint the most beneficial datapoints, data assessment and selection methods have been proposed in the fields of natural language processing (NLP) and… ▽ More

    Submitted 7 August, 2024; v1 submitted 4 August, 2024; originally announced August 2024.

    Comments: review, survey, 28 pages, 2 figures, 4 tables

  8. arXiv:2407.11620  [pdf

    eess.SP

    A Deep Learning-Based Target Radial Length Estimation Method through HRRP Sequence

    Authors: Lingfeng Chen, Panhe Hu, Zhiliang Pan, Xiao Sun, Zehao Wang

    Abstract: This paper introduces an innovative deep learning-based method for end-to-end target radial length estimation from HRRP (High Resolution Range Profile) sequences. Firstly, the HRRP sequences are normalized and transformed into GAF (Gram Angular Field) images to effectively capture and utilize the temporal information. Subsequently, these GAF images serve as the input for a pretrained ResNet-101 mo… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 2 pages, 2 figures. Accepted by APCAP 2024

  9. arXiv:2407.08236  [pdf, other

    eess.SP

    HRRPGraphNet: A Graph Neural Network Based Approach for HRRP Radar Target Recognition

    Authors: Lingfeng Chen, Panhe Hu, Zhiliang Pan, Xiao Sun, Zehao Wang

    Abstract: High Resolution Range Profiles (HRRP) have become a key area of focus in the domain of Radar Automatic Target Recognition (RATR). Despite the success of data-driven neural network-based HRRP recognition, challenges such as insufficient training samples persist in its real-world application. This letter introduces HRRPGraphNet, a novel Graph Neural Network (GNN) model designed specifically for HRRP… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures

  10. arXiv:2407.04746  [pdf

    eess.SP

    Moving Target Detection Method Based on Range? Doppler Domain Compensation and Cancellation for UAV-Mounted Radar

    Authors: Xiaodong Qu, Xiaolong Sun, Feiyang Liu, Hao Zhang, Shichao Zhong, Xiaopeng Yang

    Abstract: Combining unmanned aerial vehicle (UAV) with through-the-wall radar can realize moving targets detection in complex building scenes. However, clutters generated by obstacles and static objects are always stronger and non-stationary, which results in heavy impacts on moving targets detection. To address this issue, this paper proposes a moving target detection method based on Range-Doppler domain c… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  11. arXiv:2406.08268  [pdf, other

    eess.SY

    Multi-Static ISAC based on Network-Assisted Full-Duplex Cell-Free Networks: Performance Analysis and Duplex Mode Optimization

    Authors: Fan Zeng, Ruoyun Liu, Xiaoyu Sun, Jingxuan Yu, Jiamin Li, Pengchen Zhu, Dongming Wang, Xiaohu You

    Abstract: Multi-static integrated sensing and communication (ISAC) technology, which can achieve a wider coverage range and avoid self-interference, is an important trend for the future development of ISAC. Existing multi-static ISAC designs are unable to support the asymmetric uplink (UL)/downlink (DL) communication requirements in the scenario while simultaneously achieving optimal sensing performance. Th… ▽ More

    Submitted 12 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  12. arXiv:2405.20068  [pdf, other

    eess.SP

    An Efficient Network with Novel Quantization Designed for Massive MIMO CSI Feedback

    Authors: Xinran Sun, Zhengming Zhang, Luxi Yang

    Abstract: The efficacy of massive multiple-input multiple-output (MIMO) techniques heavily relies on the accuracy of channel state information (CSI) in frequency division duplexing (FDD) systems. Many works focus on CSI compression and quantization methods to enhance CSI reconstruction accuracy with lower feedback overhead. In this letter, we propose CsiConformer, a novel CSI feedback network that combines… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  13. arXiv:2405.11163  [pdf, other

    cs.HC eess.SP

    Domain Generalization for Zero-calibration BCIs with Knowledge Distillation-based Phase Invariant Feature Extraction

    Authors: Zilin Liang, Zheng Zheng, Weihai Chen, Xinzhi Ma, Zhongcai Pei, Xiantao Sun

    Abstract: The distribution shift of electroencephalography (EEG) data causes poor generalization of braincomputer interfaces (BCIs) in unseen domains. Some methods try to tackle this challenge by collecting a portion of user data for calibration. However, it is time-consuming, mentally fatiguing, and user-unfriendly. To achieve zerocalibration BCIs, most studies employ domain generalization (DG) techniques… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  14. arXiv:2404.18105  [pdf, other

    cs.RO eess.SP

    Tightly-Coupled VLP/INS Integrated Navigation by Inclination Estimation and Blockage Handling

    Authors: Xiao Sun, Yuan Zhuang, Xiansheng Yang, Jianzhu Huai, Tianming Huang, Daquan Feng

    Abstract: Visible Light Positioning (VLP) has emerged as a promising technology capable of delivering indoor localization with high accuracy. In VLP systems that use Photodiodes (PDs) as light receivers, the Received Signal Strength (RSS) is affected by the incidence angle of light, making the inclination of PDs a critical parameter in the positioning model. Currently, most studies assume the inclination to… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  15. arXiv:2404.05911  [pdf, other

    eess.IV cs.CV

    LATUP-Net: A Lightweight 3D Attention U-Net with Parallel Convolutions for Brain Tumor Segmentation

    Authors: Ebtihal J. Alwadee, Xianfang Sun, Yipeng Qin, Frank C. Langbein

    Abstract: Early-stage 3D brain tumor segmentation from magnetic resonance imaging (MRI) scans is crucial for prompt and effective treatment. However, this process faces the challenge of precise delineation due to the tumors' complex heterogeneity. Moreover, energy sustainability targets and resource limitations, especially in developing countries, require efficient and accessible medical imaging solutions.… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  16. Stochastic-Robust Planning of Networked Hydrogen-Electrical Microgrids: A Study on Induced Refueling Demand

    Authors: Xunhang Sun, Xiaoyu Cao, Bo Zeng, Qiaozhu Zhai, Tamer Başar, Xiaohong Guan

    Abstract: Hydrogen-electrical microgrids are increasingly assuming an important role on the pathway toward decarbonization of energy and transportation systems. This paper studies networked hydrogen-electrical microgrids planning (NHEMP), considering a critical but often-overlooked issue, i.e., the demand-inducing effect (DIE) associated with infrastructure development decisions. Specifically, higher refuel… ▽ More

    Submitted 27 August, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Journal ref: IEEE Transactions on Smart Grid (2024)

  17. arXiv:2403.08442  [pdf, ps, other

    eess.SP

    Sensor Network Localization via Riemannian Conjugate Gradient and Rank Reduction: An Extended Version

    Authors: Yicheng Li, Xinghua Sun

    Abstract: This paper addresses the Sensor Network Localization (SNL) problem using received signal strength. The SNL is formulated as an Euclidean Distance Matrix Completion (EDMC) problem under the unit ball sample model. Using the Burer-Monteiro factorization type cost function, the EDMC is solved by Riemannian conjugate gradient with Hager-Zhang line search method on a quotient manifold. A "rank reductio… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  18. arXiv:2401.11677  [pdf, ps, other

    eess.SY

    Emulation-based Stabilization for Networked Control Systems with Stochastic Channels

    Authors: Wei Ren, Wei Wang, Zhuo-Rui Pan, Xi-Ming Sun, Andrew R. Teel, Dragan Nesic

    Abstract: This paper studies the stabilization problem of networked control systems (NCSs) with random packet dropouts caused by stochastic channels. To describe the effects of stochastic channels on the information transmission, the transmission times are assumed to be deterministic, whereas the packet transmission is assumed to be random. We first propose a stochastic scheduling protocol to model random p… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 12 pages, 4 figures, accepted

  19. arXiv:2312.01479  [pdf, other

    cs.SD cs.LG eess.AS

    OpenVoice: Versatile Instant Voice Cloning

    Authors: Zengyi Qin, Wenliang Zhao, Xumin Yu, Xin Sun

    Abstract: We introduce OpenVoice, a versatile voice cloning approach that requires only a short audio clip from the reference speaker to replicate their voice and generate speech in multiple languages. OpenVoice represents a significant advancement in addressing the following open challenges in the field: 1) Flexible Voice Style Control. OpenVoice enables granular control over voice styles, including emotio… ▽ More

    Submitted 18 August, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: Technical Report

  20. arXiv:2312.00315  [pdf, ps, other

    eess.SY math.OC

    Multiple Control Functionals for Interconnected Time-Delay Systems

    Authors: Zhuo-Rui Pan, Wei Ren, Xi-Ming Sun

    Abstract: Safety is essential for autonomous systems, in particular for interconnected systems in which the interactions among subsystems are involved. Motivated by the recent interest in cyber-physical and interconnected autonomous systems, we address the safe stabilization problem of interconnected systems with time delays. We propose multiple control Lyapunov and barrier functionals for the stabilization… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 6 pages, 2 figures

  21. arXiv:2311.16572   

    eess.SY physics.ao-ph physics.soc-ph

    Adapting to climate change: Long-term impact of wind resource changes on China's power system resilience

    Authors: Jiaqi Ruan, Xiangrui Meng, Yifan Zhu, Gaoqi Liang, Xianzhuo Sun, Huayi Wu, Huijuan Xiao, Mengqian Lu, Pin Gao, Jiapeng Li, Wai-Kin Wong, Zhao Xu, Junhua Zhao

    Abstract: Modern society's reliance on power systems is at risk from the escalating effects of wind-related climate change. Yet, failure to identify the intricate relationship between wind-related climate risks and power systems could lead to serious short- and long-term issues, including partial or complete blackouts. Here, we develop a comprehensive framework to assess China's power system resilience acro… ▽ More

    Submitted 24 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Not suitable for publication

  22. arXiv:2311.16378  [pdf, other

    cs.LG eess.SP

    Bayesian Formulations for Graph Spectral Denoising

    Authors: Sam Leone, Xingzhi Sun, Michael Perlmutter, Smita Krishnaswamy

    Abstract: Here we consider the problem of denoising features associated to complex data, modeled as signals on a graph, via a smoothness prior. This is motivated in part by settings such as single-cell RNA where the data is very high-dimensional, but its structure can be captured via an affinity graph. This allows us to utilize ideas from graph signal processing. In particular, we present algorithms for the… ▽ More

    Submitted 8 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

  23. arXiv:2311.13361  [pdf, other

    cs.AI cs.HC eess.SY

    Applying Large Language Models to Power Systems: Potential Security Threats

    Authors: Jiaqi Ruan, Gaoqi Liang, Huan Zhao, Guolong Liu, Xianzhuo Sun, Jing Qiu, Zhao Xu, Fushuan Wen, Zhao Yang Dong

    Abstract: Applying large language models (LLMs) to modern power systems presents a promising avenue for enhancing decision-making and operational efficiency. However, this action may also incur potential security threats, which have not been fully recognized so far. To this end, this article analyzes potential threats incurred by applying LLMs to power systems, emphasizing the need for urgent research and d… ▽ More

    Submitted 24 January, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  24. arXiv:2311.08880  [pdf, other

    cs.RO eess.SY

    Motion Control of Two Mobile Robots under Allowable Collisions

    Authors: Li Tan, Wei Ren, Xi-Ming Sun, Junlin Xiong

    Abstract: This letter investigates the motion control problem of two mobile robots under allowable collisions. Here, the allowable collisions mean that the collisions do not damage the mobile robots. The occurrence of the collisions is discussed and the effects of the collisions on the mobile robots are analyzed to develop a hybrid model of each mobile robot under allowable collisions. Based on the effects… ▽ More

    Submitted 26 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages, 5 figures

  25. arXiv:2311.06604  [pdf, ps, other

    eess.SY

    Hub-Based Platoon Formation: Optimal Release Policies and Approximate Solutions

    Authors: Alexander Johansson, Ehsan Nekouei, Xiaotong Sun, Karl Henrik Johansson, Jonas M�rtensson

    Abstract: This paper studies the optimal hub-based platoon formation at hubs along a highway under decentralized, distributed, and centralized policies. Hubs are locations along highways where trucks can wait for other trucks to form platoons. A coordinator at each hub decides the departure time of trucks, and the released trucks from the hub will form platoons. The problem is cast as an optimization proble… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: Accepted for T-ITS 2023

  26. arXiv:2310.05021  [pdf, other

    eess.SY

    Toward Intelligent Emergency Control for Large-scale Power Systems: Convergence of Learning, Physics, Computing and Control

    Authors: Qiuhua Huang, Renke Huang, Tianzhixi Yin, Sohom Datta, Xueqing Sun, Jason Hou, Jie Tan, Wenhao Yu, Yuan Liu, Xinya Li, Bruce Palmer, Ang Li, Xinda Ke, Marianna Vaiman, Song Wang, Yousu Chen

    Abstract: This paper has delved into the pressing need for intelligent emergency control in large-scale power systems, which are experiencing significant transformations and are operating closer to their limits with more uncertainties. Learning-based control methods are promising and have shown effectiveness for intelligent power system control. However, when they are applied to large-scale power systems, t… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: submitted to PSCC 2024

  27. arXiv:2309.12611  [pdf, other

    cs.RO eess.SY

    On the Robotic Uncertainty of Fully Autonomous Traffic

    Authors: Hangyu Li, Xiaotong Sun

    Abstract: Recent transportation research suggests that autonomous vehicles (AVs) have the potential to improve traffic flow efficiency as they are able to maintain smaller car-following distances. Nevertheless, being a unique class of ground robots, AVs are susceptible to robotic errors, particularly in their perception module, leading to uncertainties in their movements and an increased risk of collisions.… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  28. arXiv:2309.09924  [pdf, other

    cs.LG eess.SP stat.ML

    Learning graph geometry and topology using dynamical systems based message-passing

    Authors: Dhananjay Bhaskar, Yanlei Zhang, Charles Xu, Xingzhi Sun, Oluwadamilola Fasina, Guy Wolf, Maximilian Nickel, Michael Perlmutter, Smita Krishnaswamy

    Abstract: In this paper we introduce DYMAG: a message passing paradigm for GNNs built on the expressive power of continuous, multiscale graph-dynamics. Standard discrete-time message passing algorithms implicitly make use of simplistic graph dynamics and aggregation schemes which limit their ability to capture fundamental graph topological properties. By contrast, DYMAG makes use of complex graph dynamics b… ▽ More

    Submitted 7 July, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

  29. arXiv:2309.08757  [pdf, other

    cs.LG eess.SP stat.AP stat.CO

    Circular Clustering with Polar Coordinate Reconstruction

    Authors: Xiaoxiao Sun, Paul Sajda

    Abstract: There is a growing interest in characterizing circular data found in biological systems. Such data are wide ranging and varied, from signal phase in neural recordings to nucleotide sequences in round genomes. Traditional clustering algorithms are often inadequate due to their limited ability to distinguish differences in the periodic component. Current clustering schemes that work in a polar coord… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Manuscript is under review in IEEE Transactions on Computational Biology and Bioinformatics. Copyright holder is credited to IEEE

  30. Constrained CycleGAN for Effective Generation of Ultrasound Sector Images of Improved Spatial Resolution

    Authors: Xiaofei Sun, He Li, Wei-Ning Lee

    Abstract: Objective. A phased or a curvilinear array produces ultrasound (US) images with a sector field of view (FOV), which inherently exhibits spatially-varying image resolution with inferior quality in the far zone and towards the two sides azimuthally. Sector US images with improved spatial resolutions are favorable for accurate quantitative analysis of large and dynamic organs, such as the heart. Ther… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Journal ref: Physics in Medicine & Biology 2023

  31. arXiv:2308.02282  [pdf, other

    cs.LG cs.AI eess.SP

    DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization

    Authors: Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xiangyang Ji, Qiang Yang, Xing Xie

    Abstract: Time series remains one of the most challenging modalities in machine learning research. The out-of-distribution (OOD) detection and generalization on time series tend to suffer due to its non-stationary property, i.e., the distribution changes over time. The dynamic distributions inside time series pose great challenges to existing algorithms to identify invariant distributions since they mainly… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Journal version of arXiv:2209.07027; 17 pages

  32. arXiv:2307.10974  [pdf, other

    cs.NE cs.CV eess.IV

    Deep Multi-Threshold Spiking-UNet for Image Processing

    Authors: Hebei Li, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

    Abstract: U-Net, known for its simple yet efficient architecture, is widely utilized for image processing tasks and is particularly suitable for deployment on neuromorphic chips. This paper introduces the novel concept of Spiking-UNet for image processing, which combines the power of Spiking Neural Networks (SNNs) with the U-Net architecture. To achieve an efficient Spiking-UNet, we face two primary challen… ▽ More

    Submitted 11 April, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted in NeuroComputing

  33. arXiv:2306.15695  [pdf, other

    cs.SI cs.LG eess.SY

    Joint Learning of Network Topology and Opinion Dynamics Based on Bandit Algorithms

    Authors: Yu Xing, Xudong Sun, Karl H. Johansson

    Abstract: We study joint learning of network topology and a mixed opinion dynamics, in which agents may have different update rules. Such a model captures the diversity of real individual interactions. We propose a learning algorithm based on multi-armed bandit algorithms to address the problem. The goal of the algorithm is to find each agent's update rule from several candidate rules and to learn the under… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

  34. arXiv:2306.09116  [pdf, other

    eess.IV cs.CV

    Accurate Airway Tree Segmentation in CT Scans via Anatomy-aware Multi-class Segmentation and Topology-guided Iterative Learning

    Authors: Puyang Wang, Dazhou Guo, Dandan Zheng, Minghui Zhang, Haogang Yu, Xin Sun, Jia Ge, Yun Gu, Le Lu, Xianghua Ye, Dakai Jin

    Abstract: Intrathoracic airway segmentation in computed tomography (CT) is a prerequisite for various respiratory disease analyses such as chronic obstructive pulmonary disease (COPD), asthma and lung cancer. Unlike other organs with simpler shapes or topology, the airway's complex tree structure imposes an unbearable burden to generate the "ground truth" label (up to 7 or 3 hours of manual or semi-automati… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  35. arXiv:2306.02886  [pdf

    eess.IV cs.CV cs.LG

    Image Reconstruction for Accelerated MR Scan with Faster Fourier Convolutional Neural Networks

    Authors: Xiaohan Liu, Yanwei Pang, Xuebin Sun, Yiming Liu, Yonghong Hou, Zhenchang Wang, Xuelong Li

    Abstract: Partial scan is a common approach to accelerate Magnetic Resonance Imaging (MRI) data acquisition in both 2D and 3D settings. However, accurately reconstructing images from partial scan data (i.e., incomplete k-space matrices) remains challenging due to lack of an effectively global receptive field in both spatial and k-space domains. To address this problem, we propose the following: (1) a novel… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  36. TG-Critic: A Timbre-Guided Model for Reference-Independent Singing Evaluation

    Authors: Xiaoheng Sun, Yuejie Gao, Hanyao Lin, Huaping Liu

    Abstract: Automatic singing evaluation independent of reference melody is a challenging task due to its subjective and multi-dimensional nature. As an essential attribute of singing voices, vocal timbre has a non-negligible effect and influence on human perception of singing quality. However, no research has been done to include timbre information explicitly in singing evaluation models. In this paper, a da… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: The annotations for datasets used in this paper and further experimental results are available at https://github.com/YuejieGao/TG-CRITIC

  37. arXiv:2305.07816  [pdf, other

    eess.IV cs.CV

    PALM: Open Fundus Photograph Dataset with Pathologic Myopia Recognition and Anatomical Structure Annotation

    Authors: Huihui Fang, Fei Li, Junde Wu, Huazhu Fu, Xu Sun, José Ignacio Orlando, Hrvoje Bogunović, Xiulan Zhang, Yanwu Xu

    Abstract: Pathologic myopia (PM) is a common blinding retinal degeneration suffered by highly myopic population. Early screening of this condition can reduce the damage caused by the associated fundus lesions and therefore prevent vision loss. Automated diagnostic tools based on artificial intelligence methods can benefit this process by aiding clinicians to identify disease signs or to screen mass populati… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 10 pages, 6 figures

  38. arXiv:2305.01319  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Long-Term Rhythmic Video Soundtracker

    Authors: Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao

    Abstract: We consider the problem of generating musical soundtracks in sync with rhythmic visual cues. Most existing works rely on pre-defined music representations, leading to the incompetence of generative flexibility and complexity. Other methods directly generating video-conditioned waveforms suffer from limited scenarios, short lengths, and unstable generation quality. To this end, we present Long-Term… ▽ More

    Submitted 30 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ICML2023

    Report number: 15

  39. arXiv:2304.13471  [pdf, other

    eess.IV cs.CV

    OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

    Authors: Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

    Abstract: 360� omnidirectional images have gained research attention due to their immersive and interactive experience, particularly in AR/VR applications. However, they suffer from lower angular resolution due to being captured by fisheye lenses with the same sensor size for capturing planar images. To solve the above issues, we propose a two-stage framework for 360� omnidirectional image superresolution.… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPRW 2023

  40. arXiv:2304.08541  [pdf, other

    eess.AS cs.SD

    How Tiny Can Analog Filterbank Features Be Made for Ultra-low-power On-device Keyword Spotting?

    Authors: Subhajit Ray, Xinghua Sun, Nolan Tremelling, Maria Gordiyenko, Peter Kinget

    Abstract: Analog feature extraction is a power-efficient and re-emerging signal processing paradigm for implementing the front-end feature extractor in on device keyword-spotting systems. Despite its power efficiency and re-emergence, there is little consensus on what values the architectural parameters of its critical block, the analog filterbank, should be set to, even though they strongly influence power… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted as a full paper by the TinyML Research Symposium 2023

  41. arXiv:2303.11661  [pdf, other

    eess.IV cs.CV

    Advanced Multi-Microscopic Views Cell Semi-supervised Segmentation

    Authors: Fang Hu, Xuexue Sun, Ke Qing, Fenxi Xiao, Zhi Wang, Xiaolu Fan

    Abstract: Although deep learning (DL) shows powerful potential in cell segmentation tasks, it suffers from poor generalization as DL-based methods originally simplified cell segmentation in detecting cell membrane boundary, lacking prominent cellular structures to position overall differentiating. Moreover, the scarcity of annotated cell images limits the performance of DL models. Segmentation limitations o… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 23 pages

  42. Optimal scheduling of park-level integrated energy system considering ladder-type carbon trading mechanism and flexible load

    Authors: Hongbin Sun, Xinmei Sun, Lei Kou, Benfa Zhang, Xiaodan Zhu

    Abstract: In an attempt to improve the utilization efficiency of multi-energy coupling in park-level integrated energy system (PIES), promote wind power consumption and reduce carbon emissions, a low-carbon economic operation optimization model of PIES integrating flexible load and carbon trading mechanism is constructed. Firstly, according to the characteristics of load response, the demand response is div… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: accepted by Energy Reports

    MSC Class: 68T30 ACM Class: K.1

  43. arXiv:2302.08107  [pdf, other

    cs.IT eess.SP

    Spectral Efficiency and Scalability Analysis for Multi-Level Cooperative Cell-Free Massive MIMO Systems

    Authors: Jiamin Li, Xiaoyu Sun, Pengcheng Zhu, Dongming Wang, Xiaohu You

    Abstract: This paper proposes a multi-level cooperative architecture to balance the spectral efficiency and scalability of cell-free massive multiple-input multiple-output (MIMO) systems. In the proposed architecture, spatial expansion units (SEUs) are introduced to avoid a large amount of computation at the access points (APs) and increase the degree of cooperation among APs. We first derive the closed-for… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 5 pages, 3 figures

  44. arXiv:2212.08525  [pdf, other

    cs.CR eess.SY

    Resource-Interaction Graph: Efficient Graph Representation for Anomaly Detection

    Authors: James Pope, Jinyuan Liang, Vijay Kumar, Francesco Raimondo, Xinyi Sun, Ryan McConville, Thomas Pasquier, Rob Piechocki, George Oikonomou, Bo Luo, Dan Howarth, Ioannis Mavromatis, Adrian Sanchez Mompo, Pietro Carnelli, Theodoros Spyridopoulos, Aftab Khan

    Abstract: Security research has concentrated on converting operating system audit logs into suitable graphs, such as provenance graphs, for analysis. However, provenance graphs can grow very large requiring significant computational resources beyond what is necessary for many security tasks and are not feasible for resource constrained environments, such as edge devices. To address this problem, we present… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: 15 pages, 11 figures, 6 tables, for dataset see https://github.com/jpope8/container-escape-dataset, for code see https://github.com/jpope8/container-escape-analysis

  45. Safe Stabilization for Stochastic Time-Delay Systems

    Authors: Zhuo-Rui Pan, Wei Ren, Xi-Ming Sun

    Abstract: This paper addresses the safe stabilization problem of stochastic nonlinear time-delay systems. Based on theKrasovskii approach, we first propose a stochastic control Lyapunov-Krasovskii functional to guarantee the stabilization objective and a stochastic control barrier-Krasovskii functional to ensure the safety objective. Both functionals are developed respectively for each control objectives fo… ▽ More

    Submitted 3 November, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: 7 pages, 8 figures. Accepted by IEEE TAC as a Technical Note. arXiv admin note: text overlap with arXiv:2204.12106

  46. arXiv:2211.05256  [pdf, other

    eess.IV cs.CV

    Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang, Hsien-Kai Kuo, Yu-Syuan Xu, Man-Yu Lee, Allen Lu, Chia-Ming Cheng, Chih-Cheng Chen, Jia-Ying Yong, Hong-Han Shuai, Wen-Huang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang , et al. (29 additional authors not shown)

    Abstract: Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this prob… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.08826, arXiv:2105.07809, arXiv:2211.04470, arXiv:2211.03885

  47. arXiv:2211.03577  [pdf

    physics.optics eess.SP physics.app-ph

    Regrowth-free AlGaInAs MQW polarization controller integrated with sidewall grating DFB laser

    Authors: Xiao Sun, Song Liang, Weiqing Cheng, Shengwei Ye, Yiming Sun, Yongguang Huang, Ruikang Zhang, Jichuan Xiong, Xuefeng Liu, John H. Marsh, Lianping Hou

    Abstract: We report an AlGaInAs multiple quantum well integrated source of polarization controlled light consisting of a polarization mode converter PMC, differential phase shifter(DPS), and a side wall grating distributed-feedback DFB laser. We demonstrate an asymmetrical stepped-height ridge waveguide PMC to realize TE to TM polarization conversion and a symmetrical straight waveguide DPS to enable polari… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2210.10519

  48. arXiv:2210.05180  [pdf, other

    cs.RO eess.SY

    Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks

    Authors: Xiaowu Sun, Yasser Shoukry

    Abstract: This paper presents a neurosymbolic framework to solve motion planning problems for mobile robots involving temporal goals. The temporal goals are described using temporal logic formulas such as Linear Temporal Logic (LTL) to capture complex tasks. The proposed framework trains Neural Network (NN)-based planners that enjoy strong correctness guarantees when applying to unseen tasks, i.e., the exac… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  49. arXiv:2210.01476  [pdf, ps, other

    math.OC cs.LG eess.SY math.DS

    Learning-based Design of Luenberger Observers for Autonomous Nonlinear Systems

    Authors: Muhammad Umar B. Niazi, John Cao, Xudong Sun, Amritam Das, Karl Henrik Johansson

    Abstract: Designing Luenberger observers for nonlinear systems involves the challenging task of transforming the state to an alternate coordinate system, possibly of higher dimensions, where the system is asymptotically stable and linear up to output injection. The observer then estimates the system's state in the original coordinates by inverting the transformation map. However, finding a suitable injectiv… ▽ More

    Submitted 5 April, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Proceedings of the 2023 American Control Conference (ACC)

  50. arXiv:2208.13970  [pdf, other

    cs.IT eess.SY

    Joint Resource Allocation and Configuration Design for STAR-RIS-Enhanced Wireless-Powered MEC

    Authors: Xintong Qin, Zhengyu Song, Tianwei Hou, Wenjuan Yu, Jun Wang, Xin Sun

    Abstract: In this paper, a novel concept called simultaneously transmitting and reflecting RIS (STAR-RIS) is introduced into the wireless-powered mobile edge computing (MEC) systems to improve the efficiency of energy transfer and task offloading. Compared with traditional reflecting-only RIS, STAR-RIS extends the half-space coverage to full-space coverage by simultaneously transmitting and reflecting incid… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.