-
Enhancing Large Language Models' Situated Faithfulness to External Contexts
Authors:
Yukun Huang,
Sanxing Chen,
Hongyi Cai,
Bhuwan Dhingra
Abstract:
Large Language Models (LLMs) are often augmented with external information as contexts, but this external information can sometimes be inaccurate or even intentionally misleading. We argue that robust LLMs should demonstrate situated faithfulness, dynamically calibrating their trust in external information based on their confidence in the internal knowledge and the external context. To benchmark t…
▽ More
Large Language Models (LLMs) are often augmented with external information as contexts, but this external information can sometimes be inaccurate or even intentionally misleading. We argue that robust LLMs should demonstrate situated faithfulness, dynamically calibrating their trust in external information based on their confidence in the internal knowledge and the external context. To benchmark this capability, we evaluate LLMs across several QA datasets, including a newly created dataset called RedditQA featuring in-the-wild incorrect contexts sourced from Reddit posts. We show that when provided with both correct and incorrect contexts, both open-source and proprietary models tend to overly rely on external information, regardless of its factual accuracy. To enhance situated faithfulness, we propose two approaches: Self-Guided Confidence Reasoning (SCR) and Rule-Based Confidence Reasoning (RCR). SCR enables models to self-access the confidence of external information relative to their own internal knowledge to produce the most accurate answer. RCR, in contrast, extracts explicit confidence signals from the LLM and determines the final answer using predefined rules. Our results show that for LLMs with strong reasoning capabilities, such as GPT-4o and GPT-4o mini, SCR outperforms RCR, achieving improvements of up to 24.2% over a direct input augmentation baseline. Conversely, for a smaller model like Llama-3-8B, RCR outperforms SCR. Fine-tuning SCR with our proposed Confidence Reasoning Direct Preference Optimization (CR-DPO) method improves performance on both seen and unseen datasets, yielding an average improvement of 8.9% on Llama-3-8B. In addition to quantitative results, we offer insights into the relative strengths of SCR and RCR. Our findings highlight promising avenues for improving situated faithfulness in LLMs. The data and code are released.
△ Less
Submitted 18 October, 2024;
originally announced October 2024.
-
Test of lepton flavour universality with $B_s^0 \rightarrow φ\ell^+\ell^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1124 additional authors not shown)
Abstract:
Lepton flavour universality in rare $b\rightarrow s$ transitions is tested for the first time using $B_s^0$ meson decays. The measurements are performed using $pp$ collision data collected by the LHCb experiment between 2011 and 2018, corresponding to a total integrated luminosity of 9$\,{\rm fb}^{-1}$. Branching fraction ratios between the $B_s^0 \rightarrow φe^+e^-$ and…
▽ More
Lepton flavour universality in rare $b\rightarrow s$ transitions is tested for the first time using $B_s^0$ meson decays. The measurements are performed using $pp$ collision data collected by the LHCb experiment between 2011 and 2018, corresponding to a total integrated luminosity of 9$\,{\rm fb}^{-1}$. Branching fraction ratios between the $B_s^0 \rightarrow φe^+e^-$ and $B_s^0 \rightarrow φμ^+μ^-$ decays are measured in three regions of dilepton mass squared, $q^2$, with $0.1 < q^2 < 1.1$, $1.1 < q^2 < 6.0$, and $15 < q^2 < 19\,{\rm GeV}^2/c^4$. The results agree with the Standard Model expectation of lepton flavour universality.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Observation of a rare beta decay of the charmed baryon with a Graph Neural Network
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (637 additional authors not shown)
Abstract:
The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the…
▽ More
The study of beta decay of the charmed baryon provides unique insights into the fundamental mechanism of the strong and electro-weak interactions. The $Λ_c^+$, being the lightest charmed baryon, undergoes disintegration solely through the charm quark weak decay. Its beta decay provides an ideal laboratory for investigating non-perturbative effects in quantum chromodynamics and for constraining the fundamental parameters of the Cabibbo-Kobayashi-Maskawa matrix in weak interaction theory. This article presents the first observation of the Cabibbo-suppressed $Λ_c^+$ beta decay into a neutron $Λ_c^+ \rightarrow n e^+ ν_{e}$, based on $4.5~\mathrm{fb}^{-1}$ of electron-positron annihilation data collected with the BESIII detector in the energy region above the $Λ^+_c\barΛ^-_c$ threshold. A novel machine learning technique, leveraging Graph Neural Networks, has been utilized to effectively separate signals from dominant backgrounds, particularly $Λ_c^+ \rightarrow Λe^+ ν_{e}$. This approach has yielded a statistical significance of more than $10σ$. The absolute branching fraction of $Λ_c^+ \rightarrow n e^+ ν_{e}$ is measured to be $(3.57\pm0.34_{\mathrm{stat}}\pm0.14_{\mathrm{syst}})\times 10^{-3}$. For the first time, the CKM matrix element $\left|V_{cd}\right|$ is extracted via a charmed baryon decay to be $0.208\pm0.011_{\rm exp.}\pm0.007_{\rm LQCD}\pm0.001_{τ_{Λ_c^+}}$. This study provides a new probe to further understand fundamental interactions in the charmed baryon sector, and demonstrates the power of modern machine learning techniques in enhancing experimental capability in high energy physics research.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Observation of $χ_{c0}\toΣ^{+}\barΣ^{-}η$ and evidence for $χ_{c1,2}\toΣ^{+}\barΣ^{-}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be…
▽ More
Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, the decay $χ_{c0}\toΣ^{+}\barΣ^{-}η$ is observed for the first time with a statistical significance of $7.0σ$, and evidence for $χ_{c1}\toΣ^{+}\barΣ^{-}η$ and $χ_{c2}\toΣ^{+}\barΣ^{-}η$ is found with statistical significances of $4.3σ$ and $4.6σ$, respectively. The branching fractions are determined to be $\mathcal{B}(χ_{c0}\toΣ^{+}\barΣ^{-}η)=({1.26 \pm 0.20 \pm 0.13}) \times 10^{-4}, ~\mathcal{B}(χ_{c1}\toΣ^{+}\barΣ^{-}η)=({5.10 \pm 1.21 \pm 0.67}) \times 10^{-5}$, and $\mathcal{B}(χ_{c2}\toΣ^{+}\barΣ^{-}η)=({5.46 \pm 1.18 \pm 0.50}) \times 10^{-5}$, where the first uncertainties are statistical, and the second ones are systematic.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Observation of the Singly Cabibbo-Suppressed Decay $Λ_c^{+}\to pπ^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (638 additional authors not shown)
Abstract:
Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured…
▽ More
Utilizing 4.5${~\rm{fb}}^{-1}$ of $e^+e^-$ annihilation data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 4.600 and 4.699 GeV, the first observation of the singly Cabibbo-suppressed decay $Λ_c^{+}\to pπ^0$ is presented, with a statistical significance of $5.4σ$. The ratio of the branching fractions of $Λ_c^{+}\to pπ^0$ and $Λ_c^{+}\to pη$ is measured as $\mathcal{B}(Λ_c^{+}\to pπ^0)/\mathcal{B}(Λ_c^{+}\to pη)=(0.120\pm0.026_{\rm stat.}\pm0.007_{\rm syst.})$. This result resolves the longstanding discrepancy between earlier experimental searches, providing both a decisive conclusion and valuable input for QCD-inspired theoretical models. A sophisticated deep learning approach using a Transformer-based architecture is employed to distinguish the signal from the prevalent hadronic backgrounds, complemented by thorough validation and systematic uncertainty quantification.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
Authors:
Hao Sun,
Jiayi Wu,
Hengyi Cai,
Xiaochi Wei,
Yue Feng,
Bo Wang,
Shuaiqiang Wang,
Yan Zhang,
Dawei Yin
Abstract:
Recent advancements in large language models (LLMs) have been remarkable. Users face a choice between using cloud-based LLMs for generation quality and deploying local-based LLMs for lower computational cost. The former option is typically costly and inefficient, while the latter usually fails to deliver satisfactory performance for reasoning steps requiring deliberate thought processes. In this w…
▽ More
Recent advancements in large language models (LLMs) have been remarkable. Users face a choice between using cloud-based LLMs for generation quality and deploying local-based LLMs for lower computational cost. The former option is typically costly and inefficient, while the latter usually fails to deliver satisfactory performance for reasoning steps requiring deliberate thought processes. In this work, we propose a novel LLM utilization paradigm that facilitates the collaborative operation of large cloud-based LLMs and smaller local-deployed LLMs. Our framework comprises two primary modules: the local agent instantiated with a relatively smaller LLM, handling less complex reasoning steps, and the cloud agent equipped with a larger LLM, managing more intricate reasoning steps. This collaborative processing is enabled through an adaptive mechanism where the local agent introspectively identifies errors and proactively seeks assistance from the cloud agent, thereby effectively integrating the strengths of both locally-deployed and cloud-based LLMs, resulting in significant enhancements in task completion performance and efficiency. We evaluate AdaSwitch across 7 benchmarks, ranging from mathematical reasoning and complex question answering, using various types of LLMs to instantiate the local and cloud agents. The empirical results show that AdaSwitch effectively improves the performance of the local agent, and sometimes achieves competitive results compared to the cloud agent while utilizing much less computational overhead.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Search for $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ at center-of-mass energies from 4.47 to 4.95 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (644 additional authors not shown)
Abstract:
Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for…
▽ More
Utilizing a data set of $6.7$ fb$^{-1}$ from electron-positron collisions recorded by the BESIII detector at the BEPCII storage ring, a search is conducted for the processes $e^{+}e^{-} \to φχ_{c0}$ and $φη_{c2}(1D)$ across center-of-mass energies from 4.47 to 4.95 GeV. In the absence of any significant signals, upper limits are set. These include limits on the Born cross sections for $e^{+}e^{-} \to φχ_{c0}$, as well as the product of the Born cross section for $e^{+}e^{-} \to φη_{c2}(1D)$ and a sum of five branching fractions. Furthermore, the product of the electronic width of $Y(4660)$ and the branching fraction of the $Y(4660) \to φχ_{c0}$, denoted as $Γ^{Y(4660)}_{e^{+}e^{-}} \mathcal{B}_{Y(4660) \to φχ_{c0}}$, is determined to be $< 0.40$ eV at the 90\% confidence level.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Efficient generation of divergent and collimated hot electrons via a novel multi-beam two-plasmon decay and stimulated Raman scattering mechanism
Authors:
K. Y. Meng,
Z. H. Cai,
J. Li,
C. Yao,
L. Hao,
F. X. Zhou,
R. Yan,
J. Zheng
Abstract:
In inertial confinement fusion (ICF) implosions, the preheating risks associated with hot electrons generated by laser plasma instabilities (LPI) are contingent upon the angular characteristics of these hot electrons for a given total energy. Using particle-in-cell simulations, we reveal a novel multi-beam collaborative mechanism of two-plasmon decay (TPD) and stimulated Raman scattering (SRS), an…
▽ More
In inertial confinement fusion (ICF) implosions, the preheating risks associated with hot electrons generated by laser plasma instabilities (LPI) are contingent upon the angular characteristics of these hot electrons for a given total energy. Using particle-in-cell simulations, we reveal a novel multi-beam collaborative mechanism of two-plasmon decay (TPD) and stimulated Raman scattering (SRS), and investigate the angular variations of hot electrons generated from this TPD-SRS instability (TSI) driven collectively by dual laser beams with varying incident angles $θ_{in}$ ($24^{\circ}$ to $55^{\circ}$ at the incident plane) for typical ICF conditions. In the simulations with $θ_{in}\gtrsim44^{\circ}$, TSI emerges as the dominant mechanism responsible for hot electron generation, leading to a wide angular distribution of hot electrons that exhibit both pronounced divergent and collimated components. The common Langmuir wave associated with TSI plays a crucial role in accelerating both components. By properly modeling the TSI common wave gains, we establish scaling relations between these gains and the energies of collimated and divergent hot electrons. These relations reveal that the divergent hot electrons are more sensitive to variations in gain compared to the collimated electrons. Additionally, the calculated gains qualitatively predict the asymmetry in hot electron angular distributions when the density gradients deviate from the bisector of the laser beams. Our findings offers insights for hot electron generation with multiple beams, potentially complementing previous experiments that underscore the critical role of overlapped intensity from symmetric beams within the same cone and the dominance of dual-beam coupling.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Observation of $χ_{cJ}\to p \bar p K^0_S K^- π^+ + c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be…
▽ More
By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays of $χ_{cJ} \to p \bar{p} K^0_S K^- π^+ +c.c.(J=0, 1, 2)$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to p \bar p K^{0}_{S} K^- π^+ + c.c.)=(2.61\pm0.27\pm0.32)\times10^{-5},$ $\mathcal{B}(χ_{c1}\to p \bar p K^{0}_{S} K^- π^+ + c.c.)=(4.16\pm0.24\pm0.46)\times10^{-5},$ and $\mathcal{B}(χ_{c2}\to p \bar p K^{0}_{S} K^- π^+ + c.c.)=(5.63\pm0.28\pm0.46)\times10^{-5}$, respectively. The processes $χ_{c1,2} \to \bar{p} Λ(1520) K^0_S π^{+} + c.c.$ are also observed, with statistical significances of 5.7$σ$ and 7.0$σ$, respectively. Evidence for $χ_{c0} \to\bar{p} Λ(1520) K^0_S π^{+} + c.c.$ is found with statistical significances of 3.3$σ$ each. The corresponding branching fractions are determined to be $\mathcal{B}(χ_{c0}\to \bar{p} Λ(1520) K^0_S π^{+} + c.c.) =(1.61^{+0.68}_{-0.64}\pm0.23)\times10^{-5}$, $\mathcal{B}(χ_{c1}\to \bar{p} Λ(1520) K^0_S π^{+} + c.c.)=(4.06^{+0.80}_{-0.76}\pm0.52)\times10^{-5}$, and $\mathcal{B}(χ_{c2}\to \bar{p} Λ(1520) K^0_S π^{+} + c.c.)=(4.09^{+0.87}_{-0.84}\pm0.42)\times10^{-5}$. Here, the first uncertainties are statistical and the second ones are systematic.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Authors:
Haotian Tang,
Yecheng Wu,
Shang Yang,
Enze Xie,
Junsong Chen,
Junyu Chen,
Zhuoyang Zhang,
Han Cai,
Yao Lu,
Song Han
Abstract:
We introduce Hybrid Autoregressive Transformer (HART), an autoregressive (AR) visual generation model capable of directly generating 1024x1024 images, rivaling diffusion models in image generation quality. Existing AR models face limitations due to the poor image reconstruction quality of their discrete tokenizers and the prohibitive training costs associated with generating 1024px images. To addr…
▽ More
We introduce Hybrid Autoregressive Transformer (HART), an autoregressive (AR) visual generation model capable of directly generating 1024x1024 images, rivaling diffusion models in image generation quality. Existing AR models face limitations due to the poor image reconstruction quality of their discrete tokenizers and the prohibitive training costs associated with generating 1024px images. To address these challenges, we present the hybrid tokenizer, which decomposes the continuous latents from the autoencoder into two components: discrete tokens representing the big picture and continuous tokens representing the residual components that cannot be represented by the discrete tokens. The discrete component is modeled by a scalable-resolution discrete AR model, while the continuous component is learned with a lightweight residual diffusion module with only 37M parameters. Compared with the discrete-only VAR tokenizer, our hybrid approach improves reconstruction FID from 2.11 to 0.30 on MJHQ-30K, leading to a 31% generation FID improvement from 7.85 to 5.38. HART also outperforms state-of-the-art diffusion models in both FID and CLIP score, with 4.5-7.7x higher throughput and 6.9-13.4x lower MACs. Our code is open sourced at https://github.com/mit-han-lab/hart.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning
Authors:
Kuofeng Gao,
Huanqia Cai,
Qingyao Shuai,
Dihong Gong,
Zhifeng Li
Abstract:
Accurate mathematical reasoning with Large Language Models (LLMs) is crucial in revolutionizing domains that heavily rely on such reasoning. However, LLMs often encounter difficulties in certain aspects of mathematical reasoning, leading to flawed reasoning and erroneous results. To mitigate these issues, we introduce a novel mechanism, the Chain of Self-Correction (CoSC), specifically designed to…
▽ More
Accurate mathematical reasoning with Large Language Models (LLMs) is crucial in revolutionizing domains that heavily rely on such reasoning. However, LLMs often encounter difficulties in certain aspects of mathematical reasoning, leading to flawed reasoning and erroneous results. To mitigate these issues, we introduce a novel mechanism, the Chain of Self-Correction (CoSC), specifically designed to embed self-correction as an inherent ability in LLMs, enabling them to validate and rectify their own results. The CoSC mechanism operates through a sequence of self-correction stages. In each stage, the LLMs generate a program to address a given problem, execute this program using program-based tools to obtain an output, subsequently verify this output. Based on the verification, the LLMs either proceed to the next correction stage or finalize the answer. This iterative self-correction process allows the LLMs to refine their reasoning steps and improve the accuracy of their mathematical reasoning. To enable the CoSC mechanism at a low cost, we employ a two-phase finetuning approach. In the first phase, the LLMs are trained with a relatively small volume of seeding data generated from GPT-4, establishing an initial CoSC capability. In the second phase, the CoSC capability is further enhanced by training with a larger volume of self-generated data using the trained model in the first phase, without relying on the paid GPT-4. Our comprehensive experiments demonstrate that CoSC significantly improves performance on traditional mathematical datasets among existing open-source LLMs. Notably, our CoSC-Code-34B model achieved a 53.5% score on MATH, the most challenging mathematical reasoning dataset in the public domain, surpassing the performance of well-established models such as ChatGPT, GPT-4, and even multi-modal LLMs like GPT-4V, Gemini-1.0 Pro, and Gemini-1.0 Ultra.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Authors:
Junyu Chen,
Han Cai,
Junsong Chen,
Enze Xie,
Shang Yang,
Haotian Tang,
Muyang Li,
Yao Lu,
Song Han
Abstract:
We present Deep Compression Autoencoder (DC-AE), a new family of autoencoder models for accelerating high-resolution diffusion models. Existing autoencoder models have demonstrated impressive results at a moderate spatial compression ratio (e.g., 8x), but fail to maintain satisfactory reconstruction accuracy for high spatial compression ratios (e.g., 64x). We address this challenge by introducing…
▽ More
We present Deep Compression Autoencoder (DC-AE), a new family of autoencoder models for accelerating high-resolution diffusion models. Existing autoencoder models have demonstrated impressive results at a moderate spatial compression ratio (e.g., 8x), but fail to maintain satisfactory reconstruction accuracy for high spatial compression ratios (e.g., 64x). We address this challenge by introducing two key techniques: (1) Residual Autoencoding, where we design our models to learn residuals based on the space-to-channel transformed features to alleviate the optimization difficulty of high spatial-compression autoencoders; (2) Decoupled High-Resolution Adaptation, an efficient decoupled three-phases training strategy for mitigating the generalization penalty of high spatial-compression autoencoders. With these designs, we improve the autoencoder's spatial compression ratio up to 128 while maintaining the reconstruction quality. Applying our DC-AE to latent diffusion models, we achieve significant speedup without accuracy drop. For example, on ImageNet 512x512, our DC-AE provides 19.1x inference speedup and 17.9x training speedup on H100 GPU for UViT-H while achieving a better FID, compared with the widely used SD-VAE-f8 autoencoder. Our code is available at https://github.com/mit-han-lab/efficientvit.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Authors:
Enze Xie,
Junsong Chen,
Junyu Chen,
Han Cai,
Haotian Tang,
Yujun Lin,
Zhekai Zhang,
Muyang Li,
Ligeng Zhu,
Yao Lu,
Song Han
Abstract:
We introduce Sana, a text-to-image framework that can efficiently generate images up to 4096$\times$4096 resolution. Sana can synthesize high-resolution, high-quality images with strong text-image alignment at a remarkably fast speed, deployable on laptop GPU. Core designs include: (1) Deep compression autoencoder: unlike traditional AEs, which compress images only 8$\times$, we trained an AE that…
▽ More
We introduce Sana, a text-to-image framework that can efficiently generate images up to 4096$\times$4096 resolution. Sana can synthesize high-resolution, high-quality images with strong text-image alignment at a remarkably fast speed, deployable on laptop GPU. Core designs include: (1) Deep compression autoencoder: unlike traditional AEs, which compress images only 8$\times$, we trained an AE that can compress images 32$\times$, effectively reducing the number of latent tokens. (2) Linear DiT: we replace all vanilla attention in DiT with linear attention, which is more efficient at high resolutions without sacrificing quality. (3) Decoder-only text encoder: we replaced T5 with modern decoder-only small LLM as the text encoder and designed complex human instruction with in-context learning to enhance the image-text alignment. (4) Efficient training and sampling: we propose Flow-DPM-Solver to reduce sampling steps, with efficient caption labeling and selection to accelerate convergence. As a result, Sana-0.6B is very competitive with modern giant diffusion model (e.g. Flux-12B), being 20 times smaller and 100+ times faster in measured throughput. Moreover, Sana-0.6B can be deployed on a 16GB laptop GPU, taking less than 1 second to generate a 1024$\times$1024 resolution image. Sana enables content creation at low cost. Code and model will be publicly released.
△ Less
Submitted 15 October, 2024; v1 submitted 14 October, 2024;
originally announced October 2024.
-
Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay Dynamics
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (643 additional authors not shown)
Abstract:
Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and…
▽ More
Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy 3.773\,GeV with the BESIII detector, we report the first observation of the semileptonic decay $D^+\to η^\prime μ^+ν_μ$ with significance of $8.6σ$ including systematic uncertainties, and an improved measurement of $D^+\to η^\prime e^+ν_e$. The branching fractions of $D^+\to η^\prime μ^+ν_μ$ and $D^+\to η^\prime e^+ν_e$ are determined to be $(1.92\pm0.28_{\rm stat}\pm 0.08_{\rm syst})\times 10^{-4}$ and $(1.79\pm0.19_{\rm stat}\pm 0.07_{\rm syst})\times 10^{-4}$, respectively. From an analysis of the $D^+\to η^\prime \ell^+ν_\ell$ decay dynamics, the product of the hadronic form factor $f_+^{η^{\prime}}(0)$ and the CKM matrix element $|V_{cd}|$ is measured for the first time, giving $f^{η^\prime}_+(0)|V_{cd}| = (5.92\pm0.56_{\rm stat}\pm0.13_{\rm syst})\times 10^{-2}$. No evidence for violation of $μ-e$ lepton-flavor universality is found in both the full range and several bins of $\ell^+ν_\ell$ four-momentum transfer. The $η-η^\prime$ mixing angle in the quark flavor basis is determined to be $φ_{\rm P} =(39.8\pm0.8_{\rm stat}\pm0.3_{\rm syst})^\circ$.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Authors:
Changle Qu,
Sunhao Dai,
Xiaochi Wei,
Hengyi Cai,
Shuaiqiang Wang,
Dawei Yin,
Jun Xu,
Ji-Rong Wen
Abstract:
Tool learning enables Large Language Models (LLMs) to interact with external environments by invoking tools, serving as an effective strategy to mitigate the limitations inherent in their pre-training data. In this process, tool documentation plays a crucial role by providing usage instructions for LLMs, thereby facilitating effective tool utilization. This paper concentrates on the critical chall…
▽ More
Tool learning enables Large Language Models (LLMs) to interact with external environments by invoking tools, serving as an effective strategy to mitigate the limitations inherent in their pre-training data. In this process, tool documentation plays a crucial role by providing usage instructions for LLMs, thereby facilitating effective tool utilization. This paper concentrates on the critical challenge of bridging the comprehension gap between LLMs and external tools due to the inadequacies and inaccuracies inherent in existing human-centric tool documentation. We propose a novel framework, DRAFT, aimed at Dynamically Refining tool documentation through the Analysis of Feedback and Trails emanating from LLMs' interactions with external tools. This methodology pivots on an innovative trial-and-error approach, consisting of three distinct learning phases: experience gathering, learning from experience, and documentation rewriting, to iteratively enhance the tool documentation. This process is further optimized by implementing a diversity-promoting exploration strategy to ensure explorative diversity and a tool-adaptive termination mechanism to prevent overfitting while enhancing efficiency. Extensive experiments on multiple datasets demonstrate that DRAFT's iterative, feedback-based refinement significantly ameliorates documentation quality, fostering a deeper comprehension and more effective utilization of tools by LLMs. Notably, our analysis reveals that the tool documentation refined via our approach demonstrates robust cross-model generalization capabilities.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
HARIVO: Harnessing Text-to-Image Models for Video Generation
Authors:
Mingi Kwon,
Seoung Wug Oh,
Yang Zhou,
Difan Liu,
Joon-Young Lee,
Haoran Cai,
Baqiao Liu,
Feng Liu,
Youngjung Uh
Abstract:
We present a method to create diffusion-based video models from pretrained Text-to-Image (T2I) models. Recently, AnimateDiff proposed freezing the T2I model while only training temporal layers. We advance this method by proposing a unique architecture, incorporating a mapping network and frame-wise tokens, tailored for video generation while maintaining the diversity and creativity of the original…
▽ More
We present a method to create diffusion-based video models from pretrained Text-to-Image (T2I) models. Recently, AnimateDiff proposed freezing the T2I model while only training temporal layers. We advance this method by proposing a unique architecture, incorporating a mapping network and frame-wise tokens, tailored for video generation while maintaining the diversity and creativity of the original T2I model. Key innovations include novel loss functions for temporal smoothness and a mitigating gradient sampling technique, ensuring realistic and temporally consistent video generation despite limited public video data. We have successfully integrated video-specific inductive biases into the architecture and loss functions. Our method, built on the frozen StableDiffusion model, simplifies training processes and allows for seamless integration with off-the-shelf models like ControlNet and DreamBooth. project page: https://kwonminki.github.io/HARIVO
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Precision Measurement of the Branching Fraction of $D^{+}\to μ^{+}ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (643 additional authors not shown)
Abstract:
Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant…
▽ More
Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of $E_{\rm cm}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we determine the branching fraction of the leptonic decay $D^+\toμ^+ν_μ$ to be $(3.981\pm0.079_{\rm stat}\pm0.040_{\rm syst})\times10^{-4}$. Interpreting our measurement with knowledge of the Fermi coupling constant $G_F$, the masses of the $D^+$ and $μ^+$ as well as the lifetime of the $D^+$, we determine $f_{D^+}|V_{cd}|=(47.53\pm0.48_{\rm stat}\pm0.24_{\rm syst}\pm0.12_{\rm input})~\mathrm{MeV}$. This result is a factor of 2.3 more precise than the previous best measurement. Using the value of the magnitude of the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cd}|$ given by the global standard model fit, we obtain the $D^+$ decay constant $f_{D^+}=(211.5\pm2.3_{\rm stat}\pm1.1_{\rm syst}\pm0.8_{\rm input})$ MeV. Alternatively, using the value of $f_{D^+}$ from a precise lattice quantum chromodynamics calculation, we extract $|V_{cd}|=0.2242\pm0.0023_{\rm stat}\pm0.0011_{\rm syst}\pm0.0009_{\rm input}$.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Edge-guided inverse design of digital metamaterials for ultra-high-capacity on-chip multi-dimensional interconnect
Authors:
Aolong Sun,
Sizhe Xing,
Xuyu Deng,
Ruoyu Shen,
An Yan,
Fangchen Hu,
Yuqin Yuan,
Boyu Dong,
Junhao Zhao,
Ouhan Huang,
Ziwei Li,
Jianyang Shi,
Yingjun Zhou,
Chao Shen,
Yiheng Zhao,
Bingzhou Hong,
Wei Chu,
Junwen Zhang,
Haiwen Cai,
Nan Chi
Abstract:
The escalating demands of compute-intensive applications, including artificial intelligence, urgently necessitate the adoption of sophisticated optical on-chip interconnect technologies to overcome critical bottlenecks in scaling future computing systems. This transition requires leveraging the inherent parallelism of wavelength and mode dimensions of light, complemented by high-order modulation f…
▽ More
The escalating demands of compute-intensive applications, including artificial intelligence, urgently necessitate the adoption of sophisticated optical on-chip interconnect technologies to overcome critical bottlenecks in scaling future computing systems. This transition requires leveraging the inherent parallelism of wavelength and mode dimensions of light, complemented by high-order modulation formats, to significantly enhance data throughput. Here we experimentally demonstrate a novel synergy of these three dimensions, achieving multi-tens-of-terabits-per-second on-chip interconnects using ultra-broadband, multi-mode digital metamaterials. Employing a highly efficient edge-guided analog-and-digital optimization method, we inversely design foundry-compatible, robust, and multi-port digital metamaterials with an 8xhigher computational efficiency. Using a packaged five-mode multiplexing chip, we demonstrate a single-wavelength interconnect capacity of 1.62 Tbit s-1 and a record-setting multi-dimensional interconnect capacity of 38.2 Tbit s-1 across 5 modes and 88 wavelength channels. A theoretical analysis suggests that further system optimization can enable on-chip interconnects to reach sub-petabit-per-second data transmission rates. This study highlights the transformative potential of optical interconnect technologies to surmount the constraints of electronic links, thus setting the stage for next-generation datacenter and optical compute interconnects.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
DeepMuon: Accelerating Cosmic-Ray Muon Simulation Based on Optimal Transport
Authors:
Ao-Bo Wang,
Chu-Cheng Pan,
Xiang Dong,
Yu-Chang Sun,
Yu-Xuan Hu,
Ao-Yan Cheng,
Hao Cai,
Xi-Long Fan
Abstract:
Cosmic muon imaging technology is increasingly being applied in various fields. However, simulating cosmic muons typically requires the rapid generation of a large number of muons and tracking their complex trajectories through intricate structures. This process is highly computationally demanding and consumes significant CPU time. To address these challenges, we introduce DeepMuon, an innovative…
▽ More
Cosmic muon imaging technology is increasingly being applied in various fields. However, simulating cosmic muons typically requires the rapid generation of a large number of muons and tracking their complex trajectories through intricate structures. This process is highly computationally demanding and consumes significant CPU time. To address these challenges, we introduce DeepMuon, an innovative deep learning model designed to efficiently and accurately generate cosmic muon distributions. In our approach, we employ the inverse Box-Cox transformation to reduce the kurtosis of the muon energy distribution, making it more statistically manageable for the model to learn. Additionally, we utilize the Sliced Wasserstein Distance (SWD) as a loss function to ensure precise simulation of the high-dimensional distributions of cosmic muons. We also demonstrate that DeepMuon can accurately learn muon distribution patterns from a limited set of data, enabling it to simulate real-world cosmic muon distributions as captured by detectors. Compared to traditional tools like CRY, DeepMuon significantly increases the speed of muon generation at sea level. Furthermore, we have developed a pipeline using DeepMuon that directly simulates muon distributions in underwater environments, dramatically accelerating simulations for underwater muon radiography and tomography. For more details on our open-source project, please visit https://github.com/wangab0/deepmuon.
△ Less
Submitted 9 October, 2024; v1 submitted 9 October, 2024;
originally announced October 2024.
-
Search for the radiative decays $D^+\toγρ^+$ and $D^+\toγK^{*+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (648 additional authors not shown)
Abstract:
We search for the radiative decays $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ using 20.3~fb$^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and the upper limits on the branching fractions of $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ at 90\% confidence level ar…
▽ More
We search for the radiative decays $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ using 20.3~fb$^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and the upper limits on the branching fractions of $D^{+} \to γρ^+$ and $D^{+} \to γK^{*+}$ at 90\% confidence level are set to be $1.3\times10^{-5}$ and $1.8\times10^{-5}$, respectively.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
Riemannian Optimization for Non-convex Euclidean Distance Geometry with Global Recovery Guarantees
Authors:
Chandler Smith,
HanQin Cai,
Abiy Tasissa
Abstract:
The problem of determining the configuration of points from partial distance information, known as the Euclidean Distance Geometry (EDG) problem, is fundamental to many tasks in the applied sciences. In this paper, we propose two algorithms grounded in the Riemannian optimization framework to address the EDG problem. Our approach formulates the problem as a low-rank matrix completion task over the…
▽ More
The problem of determining the configuration of points from partial distance information, known as the Euclidean Distance Geometry (EDG) problem, is fundamental to many tasks in the applied sciences. In this paper, we propose two algorithms grounded in the Riemannian optimization framework to address the EDG problem. Our approach formulates the problem as a low-rank matrix completion task over the Gram matrix, using partial measurements represented as expansion coefficients of the Gram matrix in a non-orthogonal basis. For the first algorithm, under a uniform sampling with replacement model for the observed distance entries, we demonstrate that, with high probability, a Riemannian gradient-like algorithm on the manifold of rank-$r$ matrices converges linearly to the true solution, given initialization via a one-step hard thresholding. This holds provided the number of samples, $m$, satisfies $m \geq \mathcal{O}(n^{7/4}r^2 \log(n))$. With a more refined initialization, achieved through resampled Riemannian gradient-like descent, we further improve this bound to $m \geq \mathcal{O}(nr^2 \log(n))$. Our analysis for the first algorithm leverages a non-self-adjoint operator and depends on deriving eigenvalue bounds for an inner product matrix of restricted basis matrices, leveraging sparsity properties for tighter guarantees than previously established. The second algorithm introduces a self-adjoint surrogate for the sampling operator. This algorithm demonstrates strong numerical performance on both synthetic and real data. Furthermore, we show that optimizing over manifolds of higher-than-rank-$r$ matrices yields superior numerical results, consistent with recent literature on overparameterization in the EDG problem.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
The Atacama Cosmology Telescope: Large-scale velocity reconstruction with the kinematic Sunyaev--Zel'dovich effect and DESI LRGs
Authors:
Fiona McCarthy,
Nicholas Battaglia,
Rachel Bean,
J. Richard Bond,
Hongbo Cai,
Erminia Calabrese,
William R. Coulton,
Mark J. Devlin,
Jo Dunkley,
Simone Ferraro,
Vera Gluscevic,
Yilun Guan,
J. Colin Hill,
Matthew C. Johnson,
Aleksandra Kusiak,
Alex Laguë,
Niall MacCrann,
Mathew S. Madhavacheril,
Kavilan Moodley,
Sigurd Naess,
Frank J. Qu,
Bernardita Ried Guachalla,
Neelima Sehgal,
Blake D. Sherwin,
Cristóbal Sifón
, et al. (5 additional authors not shown)
Abstract:
The kinematic Sunyaev--Zel'dovich (kSZ) effect induces a non-zero density-density-temperature bispectrum, which we can use to reconstruct the large-scale velocity field from a combination of cosmic microwave background (CMB) and galaxy density measurements, in a procedure known as ``kSZ velocity reconstruction''. This method has been forecast to constrain large-scale modes with future galaxy and C…
▽ More
The kinematic Sunyaev--Zel'dovich (kSZ) effect induces a non-zero density-density-temperature bispectrum, which we can use to reconstruct the large-scale velocity field from a combination of cosmic microwave background (CMB) and galaxy density measurements, in a procedure known as ``kSZ velocity reconstruction''. This method has been forecast to constrain large-scale modes with future galaxy and CMB surveys, improving their measurement beyond what is possible with the galaxy surveys alone. Such measurements will enable tighter constraints on large-scale signals such as primordial non-Gaussianity, deviations from homogeneity, and modified gravity. In this work, we demonstrate a statistically significant measurement of kSZ velocity reconstruction for the first time, by applying quadratic estimators to the combination of the ACT DR6 CMB+kSZ map and the DESI LRG galaxies (with photometric redshifts) in order to reconstruct the velocity field. We do so using a formalism appropriate for the 2-dimensional projected galaxy fields that we use, which naturally incorporates the curved-sky effects important on the largest scales. We find evidence for the signal by cross-correlating with an external estimate of the velocity field from the spectroscopic BOSS survey and rejecting the null (no-kSZ) hypothesis at $3.8σ$. Our work presents a first step towards the use of this observable for cosmological analyses.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
Observation of an axial-vector state in the study of $ψ(3686) \to φηη'$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (625 additional authors not shown)
Abstract:
Using (2712.4 $\pm$ 14.3)$\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, a partial wave analysis of the decay $ψ(3686) \to φηη' $ is performed with the covariant tensor approach. An axial-vector state with a mass near 2.3 $\rm GeV/c^2$ is observed for the first time. Its mass and width are measured to be 2316…
▽ More
Using (2712.4 $\pm$ 14.3)$\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, a partial wave analysis of the decay $ψ(3686) \to φηη' $ is performed with the covariant tensor approach. An axial-vector state with a mass near 2.3 $\rm GeV/c^2$ is observed for the first time. Its mass and width are measured to be 2316 $\pm 9_{\mathrm{stat}} \pm 30_{\mathrm{syst}}\,\rm MeV/c^2$ and 89 $\pm 15_{\mathrm{stat}} \pm 26_{\mathrm{syst}}\,\rm MeV$, respectively. The product branching fractions of $\mathcal{B}(ψ(3686) \to X(2300) η') \mathcal{B}(X(2300)\to φη)$ and $\mathcal{B}(ψ(3686) \to X(2300) η)\mathcal{B}(X(2300)\to φη')$ are determined to be (4.8 $\pm 1.3_{\mathrm{stat}} \pm 0.7_{\mathrm{syst}})\times 10^{-6}$ and (2.2 $\pm 0.7_{\mathrm{stat}} \pm 0.7_{\mathrm{syst}})\times 10^{-6}$, respectively. The branching fraction $\mathcal{B}(ψ(3686) \to φηη')$ is measured for the first time to be (3.14$\pm0.17_{\mathrm{stat}}\pm0.24_{\mathrm{syst}})\times10^{-5}$.
The first uncertainties are statistical and the second are systematic.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
Enhancing Android Malware Detection: The Influence of ChatGPT on Decision-centric Task
Authors:
Yao Li,
Sen Fang,
Tao Zhang,
Haipeng Cai
Abstract:
With the rise of large language models, such as ChatGPT, non-decisional models have been applied to various tasks. Moreover, ChatGPT has drawn attention to the traditional decision-centric task of Android malware detection. Despite effective detection methods proposed by scholars, they face low interpretability issues. Specifically, while these methods excel in classifying applications as benign o…
▽ More
With the rise of large language models, such as ChatGPT, non-decisional models have been applied to various tasks. Moreover, ChatGPT has drawn attention to the traditional decision-centric task of Android malware detection. Despite effective detection methods proposed by scholars, they face low interpretability issues. Specifically, while these methods excel in classifying applications as benign or malicious and can detect malicious behavior, they often fail to provide detailed explanations for the decisions they make. This challenge raises concerns about the reliability of existing detection schemes and questions their true ability to understand complex data. In this study, we investigate the influence of the non-decisional model, ChatGPT, on the traditional decision-centric task of Android malware detection. We choose three state-of-the-art solutions, Drebin, XMAL, and MaMaDroid, conduct a series of experiments on publicly available datasets, and carry out a comprehensive comparison and analysis. Our findings indicate that these decision-driven solutions primarily rely on statistical patterns within datasets to make decisions, rather than genuinely understanding the underlying data. In contrast, ChatGPT, as a non-decisional model, excels in providing comprehensive analysis reports, substantially enhancing interpretability. Furthermore, we conduct surveys among experienced developers. The result highlights developers' preference for ChatGPT, as it offers in-depth insights and enhances efficiency and understanding of challenges. Meanwhile, these studies and analyses offer profound insights, presenting developers with a novel perspective on Android malware detection--enhancing the reliability of detection results from a non-decisional perspective.
△ Less
Submitted 6 October, 2024;
originally announced October 2024.
-
Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats
Authors:
Mingyang Xie,
Haoming Cai,
Sachin Shah,
Yiran Xu,
Brandon Y. Feng,
Jia-Bin Huang,
Christopher A. Metzler
Abstract:
We introduce a simple yet effective approach for separating transmitted and reflected light. Our key insight is that the powerful novel view synthesis capabilities provided by modern inverse rendering methods (e.g.,~3D Gaussian splatting) allow one to perform flash/no-flash reflection separation using unpaired measurements -- this relaxation dramatically simplifies image acquisition over conventio…
▽ More
We introduce a simple yet effective approach for separating transmitted and reflected light. Our key insight is that the powerful novel view synthesis capabilities provided by modern inverse rendering methods (e.g.,~3D Gaussian splatting) allow one to perform flash/no-flash reflection separation using unpaired measurements -- this relaxation dramatically simplifies image acquisition over conventional paired flash/no-flash reflection separation methods. Through extensive real-world experiments, we demonstrate our method, Flash-Splat, accurately reconstructs both transmitted and reflected scenes in 3D. Our method outperforms existing 3D reflection separation methods, which do not leverage illumination control, by a large margin. Our project webpage is at https://flash-splat.github.io/.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Measurement of the effective leptonic weak mixing angle
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1117 additional authors not shown)
Abstract:
Using $pp$ collision data at $\sqrt{s}=13$ TeV, recorded by the LHCb experiment between 2016 and 2018 and corresponding to an integrated luminosity of $5.4$ fb$^{-1}$, the forward-backward asymmetry in the $pp \to Z/γ^{*} \to μ^+μ^-$ process is measured. The measurement is carried out in ten intervals of the difference between the muon pseudorapidities, within a fiducial region covering dimuon mas…
▽ More
Using $pp$ collision data at $\sqrt{s}=13$ TeV, recorded by the LHCb experiment between 2016 and 2018 and corresponding to an integrated luminosity of $5.4$ fb$^{-1}$, the forward-backward asymmetry in the $pp \to Z/γ^{*} \to μ^+μ^-$ process is measured. The measurement is carried out in ten intervals of the difference between the muon pseudorapidities, within a fiducial region covering dimuon masses between $66$ and $116$ GeV, muon pseudorapidities between $2.0$ and $4.5$ and muon transverse momenta above $20$ GeV. These forward-backward asymmetries are compared with predictions, at next-to-leading order in the strong and electroweak couplings. The measured effective leptonic weak mixing angle is $\sin^2θ_{\rm eff}^\ell = 0.23147 \pm 0.00044 \pm 0.00005 \pm 0.00023$, where the first uncertainty is statistical, the second arises from systematic uncertainties associated with the asymmetry measurement, and the third arises from uncertainties in the fit model used to extract $\sin^2θ_{\rm eff}^\ell$ from the asymmetry measurement. This result is based on an arithmetic average of results using the CT18, MSHT20, and NNPDF31 parameterisations of the proton internal structure, and is consistent with previous measurements and with predictions from the global electroweak fit.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Search for lepton number violating decays of $D_s^+\to h^-h^0e^+e^+$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (650 additional authors not shown)
Abstract:
Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector operating at the BEPCII collider at center-of-mass energies from 4.128 to 4.226 GeV, a search for the Majorana neutrino $ν_m$ is conducted in the lepton-number-violating decays of $D_s^+\to h^-h^0e^+e^+$. Here, $h^-$ represents a $K^-$ or $π^-$, and $h^0$ represents a $π^0$, $K_S^0$ or $φ$. No significant signal is…
▽ More
Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector operating at the BEPCII collider at center-of-mass energies from 4.128 to 4.226 GeV, a search for the Majorana neutrino $ν_m$ is conducted in the lepton-number-violating decays of $D_s^+\to h^-h^0e^+e^+$. Here, $h^-$ represents a $K^-$ or $π^-$, and $h^0$ represents a $π^0$, $K_S^0$ or $φ$. No significant signal is observed, and the upper limits of their branching fractions at the 90\% confidence level are determined to be $\mathcal{B}(D_s^+\to φπ^-e^+e^+) < 6.9 \times 10^{-5}$, $\mathcal{B}(D_s^+\to φK^-e^+e^+) < 9.9 \times 10^{-5}$, $\mathcal{B}(D_s^+\to K_S^0π^-e^+e^+) < 1.3 \times 10^{-5}$, $\mathcal{B}(D_s^+\to K_S^0K^-e^+e^+) < 2.9 \times 10^{-5}$, $\mathcal{B}(D_s^+\to π^-π^0e^+e^+) < 2.9 \times 10^{-5}$ and $\mathcal{B}(D_s^+\to K^-π^0e^+e^+) < 3.4 \times 10^{-5}$. The Majorana neutrino is searched for with different mass assumptions within the range [0.20, 0.80] GeV$/c^2$ in the decay of $D_s^+\toφe^+ν_m$ with $ν_m\toπ^-e^+$, and the upper limits of the branching fractions at the 90\% confidence level are at the level of $10^{-5}-10^{-2}$, depending on the mass of the Majorana neutrino.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Zak Phase Induced Topological Nonreciprocity
Authors:
Xiao Liu,
Jiefei Wang,
Ruosong Mao,
Huizhu Hu,
Shi-Yao Zhu,
Xingqi Xu,
Han Cai,
Da-Wei Wang
Abstract:
Topological physics provides novel insights for designing functional photonic devices, such as magnetic-free optical diodes, which are important in optical engineering and quantum information processing. Past efforts mostly focus on the topological edge modes in two-dimensional (2D) photonic Chern lattices, which, however, require delicate fabrication and temporal modulation. In particular, the 1D…
▽ More
Topological physics provides novel insights for designing functional photonic devices, such as magnetic-free optical diodes, which are important in optical engineering and quantum information processing. Past efforts mostly focus on the topological edge modes in two-dimensional (2D) photonic Chern lattices, which, however, require delicate fabrication and temporal modulation. In particular, the 1D nonreciprocal edge mode needs to be embedded in a 2D lattice, contradicting with the compactness of integrated photonics. To address these challenges, we investigate the optical nonreciprocity of the 1D Su-Schrieffer-Heeger (SSH) superradiance lattices in room-temperature atoms. The probe fields propagating in two opposite directions perceive two different SSH topological phases, which have different absorption spectra due to the interplay between the Zak phase and the thermal motion of atoms, resulting in optical nonreciprocity. Our findings reveal the relationship between 1D topological matter and optical nonreciprocity, simplifying the design of topologically resilient nonreciprocal devices.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
VibraForge: A Scalable Prototyping Toolkit For Creating Spatialized Vibrotactile Feedback Systems
Authors:
Bingjian Huang,
Siyi Ren,
Yuewen Luo,
Qilong Cheng,
Hanfeng Cai,
Yeqi Sang,
Mauricio Sousa,
Paul H. Dietz,
Daniel Wigdor
Abstract:
Spatialized vibrotactile feedback systems deliver tactile information by placing multiple vibrotactile actuators on the body. As increasing numbers of actuators are required to adequately convey information in complicated applications, haptic designers find it difficult to create such systems due to limited scalability of existing toolkits. We propose VibraForge, an open-source vibrotactile toolki…
▽ More
Spatialized vibrotactile feedback systems deliver tactile information by placing multiple vibrotactile actuators on the body. As increasing numbers of actuators are required to adequately convey information in complicated applications, haptic designers find it difficult to create such systems due to limited scalability of existing toolkits. We propose VibraForge, an open-source vibrotactile toolkit that supports up to 128 vibrotactile actuators. Each actuator is encapsulated within a self-contained vibration unit and driven by its own microcontroller. By leveraging a chain-connection method, each unit receives independent vibration commands from a control unit, with fine-grained control over intensity and frequency. We also designed a GUI Editor to expedite the authoring of spatial vibrotactile patterns. Technical evaluations show that vibration units reliably reproduce audio waveforms with low-latency and high-bandwidth data communication. Case studies of phonemic tactile display, virtual reality fitness training, and drone teleoperation demonstrate the potential usage of VibraForge within different domains.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Search for $B_{(s)}^{*0}\toμ^+μ^-$ in $B_c^+\toπ^+μ^+μ^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1113 additional authors not shown)
Abstract:
A search for the very rare $B^{*0}\toμ^+μ^-$ and $B_{s}^{*0}\toμ^+μ^-$ decays is conducted by analysing the $B_c^+\to π^+μ^+μ^-$ process. The analysis uses proton-proton collision data collected with the LHCb detector between 2011 and 2018, corresponding to an integrated luminosity of 9$\text{\,fb}^{-1}$. The signal signatures correspond to simultaneous peaks in the $μ^+μ^-$ and $π^+μ^+μ^-$ invari…
▽ More
A search for the very rare $B^{*0}\toμ^+μ^-$ and $B_{s}^{*0}\toμ^+μ^-$ decays is conducted by analysing the $B_c^+\to π^+μ^+μ^-$ process. The analysis uses proton-proton collision data collected with the LHCb detector between 2011 and 2018, corresponding to an integrated luminosity of 9$\text{\,fb}^{-1}$. The signal signatures correspond to simultaneous peaks in the $μ^+μ^-$ and $π^+μ^+μ^-$ invariant masses. No evidence for an excess of events over background is observed for either signal decay mode. Upper limits at the $90\%$ confidence level are set on the branching fractions relative to that for $B_c^+\to J\mskip -3mu/\mskip -2muψπ^+$ decays, \begin{align*}
{\cal R}_{B^{*0}(μ^+μ^-)π^+/J\mskip -3mu/\mskip -2muψπ^+} &< 3.8\times 10^{-5}\ \text{ and }
{\cal R}_{B_{s}^{*0}(μ^+μ^-)π^+/J\mskip -3mu/\mskip -2muψπ^+} &< 5.0\times 10^{-5}\,. \end{align*}
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Search for $D^0\to K^-ηe^+ν_e$, $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 7.93 fb$^{-1}$, collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we search for the semileptonic decays $D^0\to K^-ηe^+ν_e$, $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ for the first time. We present evidence for $D^0\to K^-ηe^+ν_e$ with a significance of $3.3σ$. The branching fraction…
▽ More
By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 7.93 fb$^{-1}$, collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we search for the semileptonic decays $D^0\to K^-ηe^+ν_e$, $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ for the first time. We present evidence for $D^0\to K^-ηe^+ν_e$ with a significance of $3.3σ$. The branching fraction of $D^0\to K^-ηe^+ν_e$ is measured to be $(0.84_{-0.34}^{+0.29}\pm0.22)\times 10^{-4}$. Here, the first uncertainties are statistical and the second ones are systematic. No significant signals are observed for the decays $D^+\to K_S^0 ηe^+ν_e$ and $D^+\to ηηe^+ν_e$ and we set the upper limits on their branching fractions.
△ Less
Submitted 24 September, 2024; v1 submitted 23 September, 2024;
originally announced September 2024.
-
Analysis of $\itΛ^\mathrm{0}_b \rightarrow pK^-μ^+μ^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1114 additional authors not shown)
Abstract:
The differential branching fraction and angular coefficients of \ensuremath{\itΛ^\mathrm{0}_b \rightarrow pK^-μ^+μ^-}\xspace decays are measured in bins of the dimuon mass squared and dihadron mass. The analysis is performed using a data set corresponding to 9$\aunit{fb}^{-1}$ of integrated luminosity collected with the $\mbox{LHCb}$ detector between 2011 and 2018. The data are consistent with rec…
▽ More
The differential branching fraction and angular coefficients of \ensuremath{\itΛ^\mathrm{0}_b \rightarrow pK^-μ^+μ^-}\xspace decays are measured in bins of the dimuon mass squared and dihadron mass. The analysis is performed using a data set corresponding to 9$\aunit{fb}^{-1}$ of integrated luminosity collected with the $\mbox{LHCb}$ detector between 2011 and 2018. The data are consistent with receiving contributions from a mixture of $\itΛ$ resonances with different spin-parity quantum numbers. The angular coefficients show a pattern of vector--axial vector interference that is a characteristic of the type of flavour-changing neutral-current transition relevant for these decays.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
Ideal flat and resolved SU(3) Landau levels in three dimensions
Authors:
Mian Peng,
Qiang Wei,
Jiale Yuan,
Da-Wei Wang,
Mou Yan,
Han Cai,
Gang Chen
Abstract:
Landau levels (LLs) are of great importance for understanding the quantum Hall effect and associated many-body physics. Recently, their three-dimensional (3D) counterparts, i.e., dispersionless 3D LLs with well-defined quantum numbers, have attracted significant attention but have not yet been reported. Here we theoretically propose and experimentally observe 3D LLs with a sharply quantized spectr…
▽ More
Landau levels (LLs) are of great importance for understanding the quantum Hall effect and associated many-body physics. Recently, their three-dimensional (3D) counterparts, i.e., dispersionless 3D LLs with well-defined quantum numbers, have attracted significant attention but have not yet been reported. Here we theoretically propose and experimentally observe 3D LLs with a sharply quantized spectrum in a diamond acoustic lattice, where the eigenstates are characterized by SU(3) quantum numbers. The engineered inhomogeneous hopping strengths not only introduce pseudomagnetic fields that quantize the nodal lines into LLs but also provide three bosonic degrees of freedom, embedding a generic SU(3) symmetry into the LLs. Using a phased array of acoustic sources, we selectively excite distinct eigenstates within the degenerate LL multiplets and visualize their 3D eigenmodes. Importantly, our approach enables the precise reconstruction of SU(3) quantum numbers directly from eigenmode correlations. Our results establish SU(3) LLs as a tractable model in artificial platforms, and pave the way for synthesizing LLs with zero dispersion and countable quantum numbers in arbitrary dimensions.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
Realizing tunable Fermi level in SnTe by defect control
Authors:
Bamidele Oluwagbenga Onipede,
Matthew Metcalf,
Nisha Fletcher,
Hui Cai
Abstract:
The tuning of the Fermi level in tin telluride, a topological crystalline insulator, is essential for accessing its unique surface states and optimizing its electronic properties for applications such as spintronics and quantum computing. In this study, we demonstrate that the Fermi level in tin telluride can be effectively modulated by controlling the tin concentration during chemical vapor depos…
▽ More
The tuning of the Fermi level in tin telluride, a topological crystalline insulator, is essential for accessing its unique surface states and optimizing its electronic properties for applications such as spintronics and quantum computing. In this study, we demonstrate that the Fermi level in tin telluride can be effectively modulated by controlling the tin concentration during chemical vapor deposition synthesis. By introducing tin-rich conditions, we observed a blue shift in the X-ray photoelectron spectroscopy core-level peaks of both tin and tellurium, indicating an upward shift in the Fermi level. This shift is corroborated by a decrease in work function values measured via ultraviolet photoelectron spectroscopy, confirming the suppression of Sn vacancies. Our findings provide a low-cost, scalable method to achieve tunable Fermi levels in tin telluride, offering a significant advancement in the development of materials with tailored electronic properties for next-generation technological applications.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
A 2D T-carbon 2-(111) structure with tunable electric and optical properties via chemical decorations: a first-principles investigation
Authors:
Haifang Cai,
Zhiwen Duan,
Douglas S. Galvao,
Kun Cai
Abstract:
We proposed a new two-dimensional carbon material named 2-(111) planar T-carbon, which is obtained by slicing bulk T-carbon along its (111) crystallographic direction. 2-(111) planar T-carbon's optical and electrical properties can be engineered via surface decoration. Comparing the DFT phonon spectra of pristine and five decorated 2-(111) planar T-carbon obtained by first-principles calculations,…
▽ More
We proposed a new two-dimensional carbon material named 2-(111) planar T-carbon, which is obtained by slicing bulk T-carbon along its (111) crystallographic direction. 2-(111) planar T-carbon's optical and electrical properties can be engineered via surface decoration. Comparing the DFT phonon spectra of pristine and five decorated 2-(111) planar T-carbon obtained by first-principles calculations, we conclude that surface decoration presents a promising, effective, and feasible strategy to improve the structural stability of 2-(111) planar T-carbon. The calculated band structures and electronic properties show direct electronic band gap values between 0.17 eV (-O= decorated) and 2.21 eV (Hydrogenated). Chemical decoration also promises blue or red energy shifts in its optical properties.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Measurements of the $CP$-even fractions of $D^0\toπ^{+}π^{-}π^{0}$ and $D^0\to K^{+}K^{-}π^{0}$ at BESIII
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (648 additional authors not shown)
Abstract:
The $CP$-even fractions ($F_{+}$) of the decays $D^0\toπ^{+}π^{-}π^{0}$ and $D^0\to K^{+}K^{-}π^{0}$ are measured with a quantum-correlated $ψ(3770)\to D\bar{D}$ data sample collected by the BESIII experiment corresponding to an integrated luminosity of 7.93 $\mathrm{fb}^{-1}$. The results are $F_{+}^{π^{+}π^{-}π^{0}}=0.9406\pm0.0036\pm0.0021$ and $F_{+}^{K^{+}K^{-}π^{0}}=0.631\pm0.014\pm0.011$, w…
▽ More
The $CP$-even fractions ($F_{+}$) of the decays $D^0\toπ^{+}π^{-}π^{0}$ and $D^0\to K^{+}K^{-}π^{0}$ are measured with a quantum-correlated $ψ(3770)\to D\bar{D}$ data sample collected by the BESIII experiment corresponding to an integrated luminosity of 7.93 $\mathrm{fb}^{-1}$. The results are $F_{+}^{π^{+}π^{-}π^{0}}=0.9406\pm0.0036\pm0.0021$ and $F_{+}^{K^{+}K^{-}π^{0}}=0.631\pm0.014\pm0.011$, where the first uncertainties are statistical and the second systematic. These measurements are consistent with the previous determinations, and the uncertainties for $F_{+}^{π^{+}π^{-}π^{0}}$ and $F_{+}^{K^{+}K^{-}π^{0}}$ are reduced by factors of 3.9 and 2.6, respectively. The reported results provide important inputs for the precise measurement of the angle $γ$ of the Cabibbo-Kobayashi-Maskawa matrix and indirect $CP$ violation in charm mixing.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration
Authors:
Hongyi Cai,
Mohammad Mahdinur Rahman,
Mohammad Shahid Akhtar,
Jie Li,
Jingyu Wu,
Zhili Fang
Abstract:
Image Transformers show a magnificent success in Image Restoration tasks. Nevertheless, most of transformer-based models are strictly bounded by exorbitant memory occupancy. Our goal is to reduce the memory consumption of Swin Transformer and at the same time speed up the model during training process. Thus, we introduce AgileIR, group shifted attention mechanism along with window attention, which…
▽ More
Image Transformers show a magnificent success in Image Restoration tasks. Nevertheless, most of transformer-based models are strictly bounded by exorbitant memory occupancy. Our goal is to reduce the memory consumption of Swin Transformer and at the same time speed up the model during training process. Thus, we introduce AgileIR, group shifted attention mechanism along with window attention, which sparsely simplifies the model in architecture. We propose Group Shifted Window Attention (GSWA) to decompose Shift Window Multi-head Self Attention (SW-MSA) and Window Multi-head Self Attention (W-MSA) into groups across their attention heads, contributing to shrinking memory usage in back propagation. In addition to that, we keep shifted window masking and its shifted learnable biases during training, in order to induce the model interacting across windows within the channel. We also re-allocate projection parameters to accelerate attention matrix calculation, which we found a negligible decrease in performance. As a result of experiment, compared with our baseline SwinIR and other efficient quantization models, AgileIR keeps the performance still at 32.20 dB on Set5 evaluation dataset, exceeding other methods with tailor-made efficient methods and saves over 50% memory while a large batch size is employed.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
First determination of the spin-parity of $Ξ_{c}(3055)^{+,0}$ baryons
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1109 additional authors not shown)
Abstract:
The ${Ξ_{b}^{0(-)}\toΞ_{c}(3055)^{+(0)}(\to D^{+(0)}Λ)π^{-}}$ decay chains are observed, and the spin-parity of $Ξ_{c}(3055)^{+(0)}$ baryons is determined for the first time. The measurement is performed using proton-proton collision data at a center-of-mass energy of $\sqrt{s}=13\,\text{TeV}$, corresponding to an integrated luminosity of $5.4\,\text{fb}^{-1}$, recorded by the~$\text{LHCb}$ experi…
▽ More
The ${Ξ_{b}^{0(-)}\toΞ_{c}(3055)^{+(0)}(\to D^{+(0)}Λ)π^{-}}$ decay chains are observed, and the spin-parity of $Ξ_{c}(3055)^{+(0)}$ baryons is determined for the first time. The measurement is performed using proton-proton collision data at a center-of-mass energy of $\sqrt{s}=13\,\text{TeV}$, corresponding to an integrated luminosity of $5.4\,\text{fb}^{-1}$, recorded by the~$\text{LHCb}$ experiment between 2016 and 2018. The spin-parity of the $Ξ_{c}(3055)^{+(0)}$ baryons is determined to be $3/2^{+}$ with a significance of more than $6.5σ$ ($3.5σ$) compared to all other tested hypotheses. The up-down asymmetries of the ${Ξ_{b}^{0(-)}\toΞ_{c}(3055)^{+(0)}π^{-}}$ transitions are measured to be $-0.92\pm0.10\pm0.05$ ($-0.92\pm0.16\pm0.22$), consistent with maximal parity violation, where the first uncertainty is statistical and the second is systematic. These results support the hypothesis that the $Ξ_{c}(3055)^{+(0)}$ baryons correspond to the first $D$-wave $λ$-mode excitation of the $Ξ_{c}$ flavor triplet.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Study of the decay $D^0\rightarrow ρ(770)^-e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (646 additional authors not shown)
Abstract:
We present a study of the semileptonic decay $D^0\rightarrow π^-π^0e^{+}ν_{e}$ using an $e^+e^-$ annihilation data sample of $7.93~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The branching fraction of $D^0\to ρ(770)^-e^+ν_e$ is measured to be $(1.439 \pm 0.033(\rm stat.) \pm 0.027(\rm syst.)) \times10^{-3}$, which is a factor 1.6 more precise tha…
▽ More
We present a study of the semileptonic decay $D^0\rightarrow π^-π^0e^{+}ν_{e}$ using an $e^+e^-$ annihilation data sample of $7.93~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The branching fraction of $D^0\to ρ(770)^-e^+ν_e$ is measured to be $(1.439 \pm 0.033(\rm stat.) \pm 0.027(\rm syst.)) \times10^{-3}$, which is a factor 1.6 more precise than previous measurements. By performing an amplitude analysis, we measure the hadronic form-factor ratios of $D^0\to ρ(770)^-e^+ν_e$ at $q^2=0$ assuming the single-pole-dominance parametrization: $r_{V}=V(0)/A_1(0)=1.548\pm0.079(\rm stat.)\pm0.041(\rm syst.)$ and $r_{2}=A_2(0)/A_1(0)=0.823\pm0.056(\rm stat.)\pm0.026(\rm syst.)$.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
Accelerate Neural Subspace-Based Reduced-Order Solver of Deformable Simulation by Lipschitz Optimization
Authors:
Aoran Lyu,
Shixian Zhao,
Chuhua Xian,
Zhihao Cen,
Hongmin Cai,
Guoxin Fang
Abstract:
Reduced-order simulation is an emerging method for accelerating physical simulations with high DOFs, and recently developed neural-network-based methods with nonlinear subspaces have been proven effective in diverse applications as more concise subspaces can be detected. However, the complexity and landscape of simulation objectives within the subspace have not been optimized, which leaves room fo…
▽ More
Reduced-order simulation is an emerging method for accelerating physical simulations with high DOFs, and recently developed neural-network-based methods with nonlinear subspaces have been proven effective in diverse applications as more concise subspaces can be detected. However, the complexity and landscape of simulation objectives within the subspace have not been optimized, which leaves room for enhancement of the convergence speed. This work focuses on this point by proposing a general method for finding optimized subspace mappings, enabling further acceleration of neural reduced-order simulations while capturing comprehensive representations of the configuration manifolds. We achieve this by optimizing the Lipschitz energy of the elasticity term in the simulation objective, and incorporating the cubature approximation into the training process to manage the high memory and time demands associated with optimizing the newly introduced energy. Our method is versatile and applicable to both supervised and unsupervised settings for optimizing the parameterizations of the configuration manifolds. We demonstrate the effectiveness of our approach through general cases in both quasi-static and dynamics simulations. Our method achieves acceleration factors of up to 6.83 while consistently preserving comparable simulation accuracy in various cases, including large twisting, bending, and rotational deformations with collision handling. This novel approach offers significant potential for accelerating physical simulations, and can be a good add-on to existing neural-network-based solutions in modeling complex deformable objects.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Measurement of exclusive $J/ψ$ and $ψ(2S)$ production at $\sqrt{s}=13$ TeV
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1072 additional authors not shown)
Abstract:
Measurements are presented of the cross-section for the central exclusive production of $J/ψ\toμ^+μ^-$ and $ψ(2S)\toμ^+μ^-$ processes in proton-proton collisions at $\sqrt{s} = 13 $ TeV with 2016-2018 data. They are performed by requiring both muons to be in the LHCb acceptance (with pseudorapidity $2<η_{μ^\pm} < 4.5$) and mesons in the rapidity range $2.0 < y < 4.5$. The integrated cross-section…
▽ More
Measurements are presented of the cross-section for the central exclusive production of $J/ψ\toμ^+μ^-$ and $ψ(2S)\toμ^+μ^-$ processes in proton-proton collisions at $\sqrt{s} = 13 $ TeV with 2016-2018 data. They are performed by requiring both muons to be in the LHCb acceptance (with pseudorapidity $2<η_{μ^\pm} < 4.5$) and mesons in the rapidity range $2.0 < y < 4.5$. The integrated cross-section results are \begin{equation*}
σ_{J/ψ\toμ^+μ^-}(2.0<y_{J/ψ}<4.5,2.0<η_{μ^\pm} < 4.5) = 400 \pm 2 \pm 5 \pm 12 \,{\rm pb}\,,
\end{equation*} \begin{equation*}
σ_{ψ(2S)\toμ^+μ^-}(2.0<y_{ψ(2S)}<4.5,2.0<η_{μ^\pm} < 4.5) = 9.40 \pm 0.15 \pm 0.13 \pm 0.27 \,{\rm pb}\,, \end{equation*} where the uncertainties are statistical, systematic and due to the luminosity determination. In addition, a measurement of the ratio of $ψ(2S)$ and $J/ψ$ cross-sections, at an average photon-proton centre-of-mass energy of 1 TeV, is performed, giving \begin{equation*}
\frac{σ_{ψ(2S)}}{σ_{J/ψ}} = 0.1763 \pm 0.0029 \pm 0.0008 \pm 0.0039 \,, \end{equation*} where the first uncertainty is statistical, the second systematic and the third due to the knowledge of the involved branching fractions. For the first time, the dependence of the $J/ψ$ and $ψ(2S)$ cross-sections on the total transverse momentum transfer is determined in $pp$ collisions and is found consistent with the behaviour observed in electron-proton collisions.
△ Less
Submitted 11 September, 2024; v1 submitted 5 September, 2024;
originally announced September 2024.
-
Measurement of $CP$ violation in ${B^0}\rightarrow{D^{+}D^{-}}$ and ${B^{0}_{s}}\rightarrow{D^{+}_{s}D^{-}_{s}}$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1115 additional authors not shown)
Abstract:
A time-dependent, flavour-tagged measurement of $CP$ violation is performed with ${B^0}\rightarrow{D^{+}D^{-}}$ and ${B^{0}_{s}}\rightarrow{D^{+}_{s}D^{-}_{s}}$ decays, using data collected by the LHCb detector in proton-proton collisions at a centre-of-mass energy of 13 TeV corresponding to an integrated luminosity of 6 fb$^{-1}$. In ${B^0}\rightarrow{D^{+}D^{-}}$ decays the $CP$-violation parame…
▽ More
A time-dependent, flavour-tagged measurement of $CP$ violation is performed with ${B^0}\rightarrow{D^{+}D^{-}}$ and ${B^{0}_{s}}\rightarrow{D^{+}_{s}D^{-}_{s}}$ decays, using data collected by the LHCb detector in proton-proton collisions at a centre-of-mass energy of 13 TeV corresponding to an integrated luminosity of 6 fb$^{-1}$. In ${B^0}\rightarrow{D^{+}D^{-}}$ decays the $CP$-violation parameters are measured to be \begin{align}
S_{D^{+}D^{-}} & = -0.552 \pm 0.100\,\text{(stat)} \pm 0.010\,\text{(syst)}, \nonumber \newline
C_{D^{+}D^{-}} & = \phantom{-}0.128 \pm0.103\,\text{(stat)} \pm 0.010\,\text{(syst)}. \nonumber \end{align} In $B^{0}_{s} \rightarrow D^{+}_{s}D^{-}_{s}$ decays the $CP$-violating parameter formulation in terms of $φ_{s}$ and $|λ|$ results in \begin{align}
φ_{s} & = -0.086 \pm 0.106 \,\text{(stat)} \pm 0.028\,\text{(syst)} \,\text{rad}, \nonumber \newline
|λ_{D^{+}_{s}D^{-}_{s}}| & = \phantom{-}1.145 \pm 0.126\,\text{(stat)} \pm 0.031\,\text{(syst)}. \nonumber \end{align} These results represent the most precise single measurement of the $CP$-violation parameters in their respective channels. For the first time in a single measurement, $CP$ symmetry is observed to be violated in ${B^0}\rightarrow{D^{+}D^{-}}$ decays with a significance exceeding six standard deviations.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Measurement of $\itΛ_\it{b}^0$, $\itΛ_\it{c}^+$ and $\itΛ$ decay parameters using $\itΛ_\it{b}^0 \to \itΛ_\it{c}^+ h^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1103 additional authors not shown)
Abstract:
A comprehensive study of the angular distributions in the bottom-baryon decays $\itΛ^\mathrm{0}_b\to\itΛ_c^+ h^-(h=π, K)$, followed by $\itΛ_c^+\to\itΛ h^+$ with $\itΛ\to \it{p} π^-$ or $\itΛ_c^+\to\it{p}\it{K}^0_\mathrm{S}$ decays, is performed using a data sample of proton-proton collisions corresponding to an integrated luminosity of $9~\mathrm{fb}^{-1}$ collected by the LHCb experiment at cent…
▽ More
A comprehensive study of the angular distributions in the bottom-baryon decays $\itΛ^\mathrm{0}_b\to\itΛ_c^+ h^-(h=π, K)$, followed by $\itΛ_c^+\to\itΛ h^+$ with $\itΛ\to \it{p} π^-$ or $\itΛ_c^+\to\it{p}\it{K}^0_\mathrm{S}$ decays, is performed using a data sample of proton-proton collisions corresponding to an integrated luminosity of $9~\mathrm{fb}^{-1}$ collected by the LHCb experiment at center-of-mass energies of 7, 8 and 13 $\mathrm{Te\kern -0.1em V}$. The decay parameters and the associated charge-parity ($C\!P$) asymmetries are measured, with no significant $C\!P$ violation observed. For the first time, the $\itΛ^\mathrm{0}_b \to \itΛ_c^+ h^-$ decay parameters are measured. The most precise measurements of the decay parameters $α, β$ and $γ$ are obtained for $\itΛ_c^+$ decays and an independent measurement of the decay parameters for the strange-baryon $\itΛ$ decay is provided. The results deepen our understanding of weak decay dynamics in baryon decays.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Search for the massless dark photon with $D^0\toωγ'$ and $D^0\toγγ'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
Using $7.9~\rm{fb^{-1}}$ of $e^+e^-$ collision data collected at $\sqrt{s}=3.773$ GeV with the BESIII detector at the BEPCII collider, we search for the massless dark photon with the flavor-changing neutral current processes $D^0\toωγ'$ and $D^0\toγγ'$ for the first time. No significant signals are observed, and the upper limits at the 90% confidence level on the massless dark photon branching fra…
▽ More
Using $7.9~\rm{fb^{-1}}$ of $e^+e^-$ collision data collected at $\sqrt{s}=3.773$ GeV with the BESIII detector at the BEPCII collider, we search for the massless dark photon with the flavor-changing neutral current processes $D^0\toωγ'$ and $D^0\toγγ'$ for the first time. No significant signals are observed, and the upper limits at the 90% confidence level on the massless dark photon branching fraction are set to be $1.1\times10^{-5}$ and $2.0\times10^{-6}$ for $D^0\toωγ'$ and $D^0\toγγ'$, respectively. These results provide the most stringent constraint on the new physics energy scale associated with $cuγ'$ coupling in the world, with the new physics energy scale related parameter $|\mathbb{C}|^2+|\mathbb{C}_5|^2<8.2\times10^{-17}~\rm{GeV}^{-2}$ at the 90% confidence level.
△ Less
Submitted 14 October, 2024; v1 submitted 4 September, 2024;
originally announced September 2024.
-
Cooperative Global $\mathcal{K}$-exponential Tracking Control of Multiple Mobile Robots -- Extended Version
Authors:
Liang Xu,
Youfeng Su,
He Cai
Abstract:
This paper studies the cooperative tracking control problem for multiple mobile robots over a directed communication network. First, it is shown that the closed-loop system is uniformly globally asymptotically stable under the proposed distributed continuous feedback control law, where an explicit strict Lyapunov function is constructed. Then, by investigating the convergence rate, it is further p…
▽ More
This paper studies the cooperative tracking control problem for multiple mobile robots over a directed communication network. First, it is shown that the closed-loop system is uniformly globally asymptotically stable under the proposed distributed continuous feedback control law, where an explicit strict Lyapunov function is constructed. Then, by investigating the convergence rate, it is further proven that the closed-loop system is globally $\mathcal{K}$-exponentially stable. Moreover, to make the proposed control law more practical, the distributed continuous feedback control law is generalized to a distributed sampled-data feedback control law using the emulation approach, based on the strong integral input-to-state stable Lyapunov function. Numerical simulations are presented to validate the effectiveness of the proposed control methods.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Study of $D^{+} \to K_{S}^{0}K^{*}(892)^{+}$ in $D^{+} \to K_{S}^{0} K_{S}^{0} π^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (638 additional authors not shown)
Abstract:
Using a data sample of $e^+e^-$ collisions corresponding to an integrated luminosity of 7.93 $\rm fb^{-1}$ collected with the BESIII detector at the center-of-mass energy 3.773~GeV, we perform the first amplitude analysis of the decay $D^{+} \to K_{S}^{0} K_{S}^{0} π^{+}$. The absolute branching fraction of $D^{+} \to K_{S}^{0}K_{S}^{0} π^{+}$ is measured to be…
▽ More
Using a data sample of $e^+e^-$ collisions corresponding to an integrated luminosity of 7.93 $\rm fb^{-1}$ collected with the BESIII detector at the center-of-mass energy 3.773~GeV, we perform the first amplitude analysis of the decay $D^{+} \to K_{S}^{0} K_{S}^{0} π^{+}$. The absolute branching fraction of $D^{+} \to K_{S}^{0}K_{S}^{0} π^{+}$ is measured to be $(2.97 \pm 0.09_{\rm stat.} \pm 0.05_{\rm syst.})\times10^{-3}$. The dominant intermediate process is $D^{+} \to K_{S}^{0}K^{*}(892)^{+}$, whose branching fraction is determined to be $(8.72 \pm 0.28_{\rm stat.} \pm 0.15_{\rm syst.}) \times 10^{-3}$, including all the $K^*(892)^+$ decays.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Measurement of $C\!P$ violation observables in $D^+\rightarrow K^-K^+π^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1109 additional authors not shown)
Abstract:
A search for violation of the charge-parity $C\!P$ symmetry in the $D^+\rightarrow K^-K^+π^+$ decay is presented, with proton-proton collision data corresponding to an integrated luminosity of 5.4 fb$^{-1}$, collected at a center-of-mass energy of $13$ TeV with the LHCb detector. A novel model-independent technique is used to compare the $D^+$ and $D^-$ phase-space distributions, with instrumental…
▽ More
A search for violation of the charge-parity $C\!P$ symmetry in the $D^+\rightarrow K^-K^+π^+$ decay is presented, with proton-proton collision data corresponding to an integrated luminosity of 5.4 fb$^{-1}$, collected at a center-of-mass energy of $13$ TeV with the LHCb detector. A novel model-independent technique is used to compare the $D^+$ and $D^-$ phase-space distributions, with instrumental asymmetries subtracted using the $D^+_{s}\rightarrow K^-K^+π^+$ decay as a control channel. The $p$-value for the hypothesis of $C\!P$ conservation is $8.1\%$. The $C\!P$ asymmetry observables $A_{C\!P|S}^{φπ^+} = (0.95 \pm 0.43_{stat} \pm 0.26_{syst})\times 10^{-3}$ and $A_{C\!P|S}^{\overline{K}^{*0}K^+} = (-0.26 \pm 0.56_{ stat} \pm 0.18_{syst})\times 10^{-3}$ are also measured. These results show no evidence of $C\!P$ violation and represent the most sensitive search performed through the phase space of a multibody decay.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Correlating Time Series with Interpretable Convolutional Kernels
Authors:
Xinyu Chen,
HanQin Cai,
Fuqiang Liu,
Jinhua Zhao
Abstract:
This study addresses the problem of convolutional kernel learning in univariate, multivariate, and multidimensional time series data, which is crucial for interpreting temporal patterns in time series and supporting downstream machine learning tasks. First, we propose formulating convolutional kernel learning for univariate time series as a sparse regression problem with a non-negative constraint,…
▽ More
This study addresses the problem of convolutional kernel learning in univariate, multivariate, and multidimensional time series data, which is crucial for interpreting temporal patterns in time series and supporting downstream machine learning tasks. First, we propose formulating convolutional kernel learning for univariate time series as a sparse regression problem with a non-negative constraint, leveraging the properties of circular convolution and circulant matrices. Second, to generalize this approach to multivariate and multidimensional time series data, we use tensor computations, reformulating the convolutional kernel learning problem in the form of tensors. This is further converted into a standard sparse regression problem through vectorization and tensor unfolding operations. In the proposed methodology, the optimization problem is addressed using the existing non-negative subspace pursuit method, enabling the convolutional kernel to capture temporal correlations and patterns. To evaluate the proposed model, we apply it to several real-world time series datasets. On the multidimensional rideshare and taxi trip data from New York City and Chicago, the convolutional kernels reveal interpretable local correlations and cyclical patterns, such as weekly seasonality. In the context of multidimensional fluid flow data, both local and nonlocal correlations captured by the convolutional kernels can reinforce tensor factorization, leading to performance improvements in fluid flow reconstruction tasks. Thus, this study lays an insightful foundation for automatically learning convolutional kernels from time series data, with an emphasis on interpretability through sparsity and non-negativity constraints.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Forecasts on Anisotropic Cosmic Birefringence Constraints for CMB Experiment in the Northern Hemisphere
Authors:
Yiwei Zhong,
Hongbo Cai,
Si-Yu Li,
Yang Liu,
Mingzhe Li,
Wenjuan Fang
Abstract:
The study of cosmic birefringence through Cosmic Microwave Background (CMB) experiments is a key research area in cosmology and particle physics, providing a critical test for Lorentz and CPT symmetries. This paper focuses on an upcoming CMB experiment in the mid-latitude of the Northern Hemisphere, and investigates the potential to detect anisotropies in cosmic birefringence. Applying a quadratic…
▽ More
The study of cosmic birefringence through Cosmic Microwave Background (CMB) experiments is a key research area in cosmology and particle physics, providing a critical test for Lorentz and CPT symmetries. This paper focuses on an upcoming CMB experiment in the mid-latitude of the Northern Hemisphere, and investigates the potential to detect anisotropies in cosmic birefringence. Applying a quadratic estimator on simulated polarization data, we reconstruct the power spectrum of anisotropic cosmic birefringence successfully and estimate constraints on the amplitude of the spectrum, $A_{\mathrm{CB}}$, assuming scale invariance. The forecast is based on a wide-scan observation strategy during winter, yielding an effective sky coverage of approximately 23.6%. We consider two noise scenarios corresponding to the short-term and long-term phases of the experiment. Our results show that with a small aperture telescope operating at 95/150GHz, the $2σ$ upper bound for $A_{\mathrm{CB}}$ can reach 0.017 under the low noise scenario when adopting the method of merging multi-frequency data in map domain, and merging multi-frequency data in spectrum domain tightens the limit by about 10%.A large-aperture telescope with the same bands is found to be more effective, tightening the $2σ$ upper limit to 0.0062.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Measurement of Born cross sections of $e^+e^-\toΞ^0\barΞ^0$ and search for charmonium(-like) states at $\sqrt{s}$ = 3.51-4.95 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected by the BESIII detector at BEPCII corresponding to an integrated luminosity of 30 $\rm fb^{-1}$, we measure Born cross sections and effective form factors for the process $e^+e^-\toΞ^0\barΞ^0$ at forty-five center-of-mass energies between 3.51 and 4.95 GeV. The dressed cross section is fitted, assuming a power-law function plus a charmonium(-like) state, i.e.…
▽ More
Using $e^+e^-$ collision data collected by the BESIII detector at BEPCII corresponding to an integrated luminosity of 30 $\rm fb^{-1}$, we measure Born cross sections and effective form factors for the process $e^+e^-\toΞ^0\barΞ^0$ at forty-five center-of-mass energies between 3.51 and 4.95 GeV. The dressed cross section is fitted, assuming a power-law function plus a charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $ψ(4230)$, $ψ(4360)$, $ψ(4415)$ or $ψ(4660)$. No significant charmonium(-like) state decaying into $Ξ^0\barΞ^0$ is observed. Upper limits at the 90% confidence level on the product of the branching fraction and the electronic partial width are provided for each decay. In addition, ratios of the Born cross sections and the effective form factors for $e^+e^-\toΞ^0\barΞ^0$ and $e^+e^-\toΞ^-\barΞ^+$ are also presented to test isospin symmetry and the vector meson dominance model.
△ Less
Submitted 31 August, 2024;
originally announced September 2024.