-
Field-free superconducting diode effect and magnetochiral anisotropy in FeTe0.7Se0.3 junctions with the inherent asymmetric barrier
Authors:
Shengyao Li,
Ya Deng,
Dianyi Hu,
Chao Zhu,
Zherui Yang,
Wanghao Tian,
Xueyan Wang,
Ming Yue,
Qiong Wu,
Zheng Liu,
Xiao Renshaw Wang
Abstract:
Nonreciprocal electrical transport, characterized by an asymmetric relationship between current and voltage, plays a crucial role in modern electronic industries. Recent studies have extended this phenomenon to superconductors, introducing the concept of the superconducting diode effect (SDE). The SDE is characterized by unequal critical supercurrents along opposite directions. Due to the requirem…
▽ More
Nonreciprocal electrical transport, characterized by an asymmetric relationship between current and voltage, plays a crucial role in modern electronic industries. Recent studies have extended this phenomenon to superconductors, introducing the concept of the superconducting diode effect (SDE). The SDE is characterized by unequal critical supercurrents along opposite directions. Due to the requirement on broken inversion symmetry, the SDE is commonly accompanied by electrical magnetochiral anisotropy (eMCA) in the resistive state. Achieving a magnetic field-free SDE with field tunability is pivotal for advancements in superconductor devices. Conventionally, the field-free SDE has been achieved in Josephson junctions by intentionally intercalating an asymmetric barrier layer. Alternatively, internal magnetism was employed. Both approaches pose challenges in the selection of superconductors and fabrication processes, thereby impeding the development of SDE. Here, we present a field-free SDE in FeTe0.7Se0.3 (FTS) junction with eMCA, a phenomenon absent in FTS single nanosheets. The field-free property is associated with the presence of a gradient oxide layer on the upper surface of each FTS nanosheet, while the eMCA is linked to spin-splitting arising from the absence of inversion symmetry. Both the SDE and eMCA respond to magnetic fields with distinct temperature dependencies. This work presents a versatile and straightforward strategy for advancing superconducting electronics.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
GA-NIFS & EIGER: A merging quasar host at z=7 with an overmassive black hole
Authors:
Madeline A. Marshall,
Minghao Yue,
Anna-Christina Eilers,
Jan Scholtz,
Michele Perna,
Chris J. Willott,
Roberto Maiolino,
Hannah �bler,
Santiago Arribas,
Andrew J. Bunker,
Stephane Charlot,
Bruno Rodr�guez Del Pino,
Torsten B�ker,
Stefano Carniani,
Giovanni Cresci,
Francesco D'Eugenio,
Gareth C. Jones,
Giacomo Venturi,
Rongmon Bordoloi,
Daichi Kashino,
Ruari Mackenzie,
Jorryt Matthee,
Rohan Naidu,
Robert A. Simcoe
Abstract:
The James Webb Space Telescope is revolutionising our ability to understand the host galaxies and local environments of high-z quasars. Here we obtain a comprehensive understanding of the host galaxy of the z=7.08 quasar J1120+0641 by combining NIRSpec integral field spectroscopy with NIRCam photometry of the host continuum emission. Our emission line maps reveal that this quasar host is undergoin…
▽ More
The James Webb Space Telescope is revolutionising our ability to understand the host galaxies and local environments of high-z quasars. Here we obtain a comprehensive understanding of the host galaxy of the z=7.08 quasar J1120+0641 by combining NIRSpec integral field spectroscopy with NIRCam photometry of the host continuum emission. Our emission line maps reveal that this quasar host is undergoing a merger with a bright companion galaxy. The quasar host and the companion have similar dynamical masses of $\sim10^{10}M_\odot$, suggesting that this is a major galaxy interaction. Through detailed quasar subtraction and SED fitting using the NIRCam data, we obtain an estimate of the host stellar mass of $M_{\ast}\simeq2.6\times10^9M_\odot$, with $M_{*}\simeq5.0\times10^9M_\odot$ for the companion galaxy. Using the H$β$ Balmer line we estimate a virial black hole mass of $M_{\rm{BH}}\simeq1.4\times10^9 M_\odot$. Thus, J1120+0641 has an extreme black hole - stellar mass ratio of $M_{\rm{BH}}/M_\ast\simeq0.54$, which is ~3 dex larger than expected by the local scaling relations between black hole and stellar mass. J1120+0641 is powered by an overmassive black hole with the highest reported black hole - stellar mass ratio, in a quasar host that is currently undergoing a major merger -- these new insights highlight the power of JWST for measuring and understanding these extreme first quasars.
△ Less
Submitted 17 October, 2024; v1 submitted 14 October, 2024;
originally announced October 2024.
-
Towards Scalable Semantic Representation for Recommendation
Authors:
Taolin Zhang,
Junwei Pan,
Jinpeng Wang,
Yaohua Zha,
Tao Dai,
Bin Chen,
Ruisheng Luo,
Xiaoxiang Deng,
Yuan Wang,
Ming Yue,
Jie Jiang,
Shu-Tao Xia
Abstract:
With recent advances in large language models (LLMs), there has been emerging numbers of research in developing Semantic IDs based on LLMs to enhance the performance of recommendation systems. However, the dimension of these embeddings needs to match that of the ID embedding in recommendation, which is usually much smaller than the original length. Such dimension compression results in inevitable…
▽ More
With recent advances in large language models (LLMs), there has been emerging numbers of research in developing Semantic IDs based on LLMs to enhance the performance of recommendation systems. However, the dimension of these embeddings needs to match that of the ID embedding in recommendation, which is usually much smaller than the original length. Such dimension compression results in inevitable losses in discriminability and dimension robustness of the LLM embeddings, which motivates us to scale up the semantic representation. In this paper, we propose Mixture-of-Codes, which first constructs multiple independent codebooks for LLM representation in the indexing stage, and then utilizes the Semantic Representation along with a fusion module for the downstream recommendation stage. Extensive analysis and experiments demonstrate that our method achieves superior discriminability and dimension robustness scalability, leading to the best scale-up performance in recommendations.
△ Less
Submitted 12 October, 2024;
originally announced October 2024.
-
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Authors:
Murong Yue,
Wenlin Yao,
Haitao Mi,
Dian Yu,
Ziyu Yao,
Dong Yu
Abstract:
Enhancing the capability of large language models (LLMs) in reasoning has gained significant attention in recent years. Previous studies have demonstrated the effectiveness of various prompting strategies in aiding LLMs in reasoning (called "reasoning actions"), such as step-by-step thinking, reflecting before answering, solving with programs, and their combinations. However, these approaches ofte…
▽ More
Enhancing the capability of large language models (LLMs) in reasoning has gained significant attention in recent years. Previous studies have demonstrated the effectiveness of various prompting strategies in aiding LLMs in reasoning (called "reasoning actions"), such as step-by-step thinking, reflecting before answering, solving with programs, and their combinations. However, these approaches often applied static, predefined reasoning actions uniformly to all questions, without considering the specific characteristics of each question or the capability of the task-solving LLM. In this paper, we propose DOTS, an approach enabling LLMs to reason dynamically via optimal reasoning trajectory search, tailored to the specific characteristics of each question and the inherent capability of the task-solving LLM. Our approach involves three key steps: i) defining atomic reasoning action modules that can be composed into various reasoning action trajectories; ii) searching for the optimal action trajectory for each training question through iterative exploration and evaluation for the specific task-solving LLM; and iii) using the collected optimal trajectories to train an LLM to plan for the reasoning trajectories of unseen questions. In particular, we propose two learning paradigms, i.e., fine-tuning an external LLM as a planner to guide the task-solving LLM, or directly fine-tuning the task-solving LLM with an internalized capability for reasoning actions planning. Our experiments across eight reasoning tasks show that our method consistently outperforms static reasoning techniques and the vanilla instruction tuning approach. Further analysis reveals that our method enables LLMs to adjust their computation based on problem complexity, allocating deeper thinking and reasoning to harder problems.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
A SPectroscopic survey of biased halos In the Reionization Era (ASPIRE): JWST Supports Earlier Reionization around [OIII] Emitters
Authors:
Xiangyu Jin,
Jinyi Yang,
Xiaohui Fan,
Feige Wang,
Koki Kakiichi,
Romain A. Meyer,
George D. Becker,
Siwei Zou,
Eduardo Ba�ados,
Jaclyn B. Champagne,
Valentina D'Odorico,
Minghao Yue,
Sarah E. I. Bosman,
Zheng Cai,
Anna-Christina Eilers,
Joseph F. Hennawi,
Hyunsung D. Jun,
Mingyu Li,
Zihao Li,
Weizhe Liu,
Maria Pudoka,
Sindhu Satyavolu,
Fengwu Sun,
Wei Leong Tee,
Yunjing Wu
Abstract:
Understanding when and how reionization happened is crucial for studying the early structure formation and the properties of first galaxies in the Universe. At $z>5.5$, the observed IGM optical depth shows a significant scatter, indicating an inhomogeneous reionization process. However, the nature of the inhomogeneous reionization remains debated. ASPIRE is a JWST Cycle 1 program that has spectros…
▽ More
Understanding when and how reionization happened is crucial for studying the early structure formation and the properties of first galaxies in the Universe. At $z>5.5$, the observed IGM optical depth shows a significant scatter, indicating an inhomogeneous reionization process. However, the nature of the inhomogeneous reionization remains debated. ASPIRE is a JWST Cycle 1 program that has spectroscopically identified $>400$ [OIII] emitters in 25 quasar fields at $z>6.5$. Combined with deep ground-based optical spectroscopy of ASPIRE quasars, ASPIRE program provides the current largest sample for IGM-galaxy connection studies during cosmic reionization. We present the first results of IGM effective optical depth measurements around [OIII] emitters using 14 ASPIRE quasar fields. We find the IGM transmission is tightly related with reionization-era galaxies to the extent that significant excess of Ly$α$ transmission exists around [OIII] emitters. We measure the stacked IGM effective optical depth of IGM patches associated with [OIII] emitters and find they reach the same IGM effective optical depth at least dz~0.1 ahead of those IGM patches where no [OIII] emitters are detected, supporting earlier reionization around [OIII] emitters. Our results indicate an enhancement in IGM Ly$α$ transmission around [OIII] emitters at scales beyond 25 $h^{-1}$ cMpc, consistent with the predicted topology of reionization from fluctuating UV background (UVB) models.
△ Less
Submitted 2 October, 2024;
originally announced October 2024.
-
Discovery and Characterization of Cross-Area and Intra-Area SSOs Sensitive to Delay in Droop Control of Grid-Forming Converters
Authors:
Lilan Karunaratne,
Nilanjan Ray Chaudhuri,
Amirthagunaraj Yogarathnam,
Meng Yue
Abstract:
Subsynchronous oscillations (SSOs) involving grid-forming converters (GFCs) are in a less familiar territory of power system dynamics. This letter reports a new phenomenon namely cross-area SSOs in grids with 100% droop-controlled GFC-based renewable penetration, which was discovered during our study on evaluating the adequacy of quasistationary phasor calculus (QPC) and space phasor calculus (SPC…
▽ More
Subsynchronous oscillations (SSOs) involving grid-forming converters (GFCs) are in a less familiar territory of power system dynamics. This letter reports a new phenomenon namely cross-area SSOs in grids with 100% droop-controlled GFC-based renewable penetration, which was discovered during our study on evaluating the adequacy of quasistationary phasor calculus (QPC) and space phasor calculus (SPC)-based models in capturing SSOs. We present frequency-domain characterization of such oscillatory modes in addition to intra-area SSOs in grids involving GFCs and study the impact of a delay in power-frequency droop feedback loop in regards to their stability. Electromagnetic transient (EMT) simulations validate our findings.
△ Less
Submitted 15 September, 2024;
originally announced September 2024.
-
Randomized Submanifold Subgradient Method for Optimization over Stiefel Manifolds
Authors:
Andy Yat-Ming Cheung,
Jinxin Wang,
Man-Chung Yue,
Anthony Man-Cho So
Abstract:
Optimization over Stiefel manifolds has found wide applications in many scientific and engineering domains. Despite considerable research effort, high-dimensional optimization problems over Stiefel manifolds remain challenging, and the situation is exacerbated by nonsmooth objective functions. The purpose of this paper is to propose and study a novel coordinate-type algorithm for weakly convex (po…
▽ More
Optimization over Stiefel manifolds has found wide applications in many scientific and engineering domains. Despite considerable research effort, high-dimensional optimization problems over Stiefel manifolds remain challenging, and the situation is exacerbated by nonsmooth objective functions. The purpose of this paper is to propose and study a novel coordinate-type algorithm for weakly convex (possibly nonsmooth) optimization problems over high-dimensional Stiefel manifolds, named randomized submanifold subgradient method (RSSM). Similar to coordinate-type algorithms in the Euclidean setting, RSSM exhibits low per-iteration cost and is suitable for high-dimensional problems. We prove that RSSM converges to the set of stationary points and attains $\varepsilon$-stationary points with respect to a natural stationarity measure in $\mathcal{O}(\varepsilon^{-4})$ iterations in both expectation and the almost-sure senses. To the best of our knowledge, these are the first convergence guarantees for coordinate-type algorithms to optimize nonconvex nonsmooth functions over Stiefel manifolds. An important technical tool in our convergence analysis is a new Riemannian subgradient inequality for weakly convex functions on proximally smooth matrix manifolds, which could be of independent interest.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
A Case Study on Modeling Adequacy of a Grid with Subsynchronous Oscillations Involving IBRs
Authors:
Lilan Karunaratne,
Nilanjan Ray Chaudhuri,
Amirthagunaraj Yogarathnam,
Meng Yue
Abstract:
A case study on modeling adequacy of a grid in presence of renewable resources based on grid-forming converters (GFCs) is the subject matter of this paper. For this purpose, a 4-machine 11-bus IEEE benchmark model is modified by considering GFCs replacing synchronous generators that led to unstable subsynchronous oscillations (SSOs). We aim to: (a) understand if transmission network dynamics shoul…
▽ More
A case study on modeling adequacy of a grid in presence of renewable resources based on grid-forming converters (GFCs) is the subject matter of this paper. For this purpose, a 4-machine 11-bus IEEE benchmark model is modified by considering GFCs replacing synchronous generators that led to unstable subsynchronous oscillations (SSOs). We aim to: (a) understand if transmission network dynamics should be considered in such cases, (b) revisit the space-phasor-calculus (SPC) in d-q frame under balanced condition that captures such phenomena and lends itself to eigenvalue analysis, and (c) emphasize limitations of such models while underscoring their importance for large-scale power system simulations. Time-domain and frequency-domain results from SPC and quasistationary phasor calculus (QPC) models are compared with electromagnetic transient (EMT)-based simulations. It is shown that models with transmission line dynamics in SPC framework can capture the SSO mode while QPC models that neglect these dynamics fail to do so.
△ Less
Submitted 24 June, 2024;
originally announced July 2024.
-
A Geometric Unification of Distributionally Robust Covariance Estimators: Shrinking the Spectrum by Inflating the Ambiguity Set
Authors:
Man-Chung Yue,
Yves Rychener,
Daniel Kuhn,
Viet Anh Nguyen
Abstract:
The state-of-the-art methods for estimating high-dimensional covariance matrices all shrink the eigenvalues of the sample covariance matrix towards a data-insensitive shrinkage target. The underlying shrinkage transformation is either chosen heuristically - without compelling theoretical justification - or optimally in view of restrictive distributional assumptions. In this paper, we propose a pri…
▽ More
The state-of-the-art methods for estimating high-dimensional covariance matrices all shrink the eigenvalues of the sample covariance matrix towards a data-insensitive shrinkage target. The underlying shrinkage transformation is either chosen heuristically - without compelling theoretical justification - or optimally in view of restrictive distributional assumptions. In this paper, we propose a principled approach to construct covariance estimators without imposing restrictive assumptions. That is, we study distributionally robust covariance estimation problems that minimize the worst-case Frobenius error with respect to all data distributions close to a nominal distribution, where the proximity of distributions is measured via a divergence on the space of covariance matrices. We identify mild conditions on this divergence under which the resulting minimizers represent shrinkage estimators. We show that the corresponding shrinkage transformations are intimately related to the geometrical properties of the underlying divergence. We also prove that our robust estimators are efficiently computable and asymptotically consistent and that they enjoy finite-sample performance guarantees. We exemplify our general methodology by synthesizing explicit estimators induced by the Kullback-Leibler, Fisher-Rao, and Wasserstein divergences. Numerical experiments based on synthetic and real data show that our robust estimators are competitive with state-of-the-art estimators.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
MAMMOTH-Subaru. II. Diverse Populations of Circumgalactic Ly$α$ Nebulae at Cosmic Noon
Authors:
Mingyu Li,
Haibin Zhang,
Zheng Cai,
Yongming Liang,
Nobunari Kashikawa,
Ke Ma,
Xiaohui Fan,
J. Xavier Prochaska,
Bjorn H. C. Emonts,
Xin Wang,
Yunjing Wu,
Shiwu Zhang,
Qiong Li,
Sean D. Johnson,
Minghao Yue,
Fabrizio Arrigoni Battaia,
Sebastiano Cantalupo,
Joseph F. Hennawi,
Satoshi Kikuta,
Yuanhang Ning,
Masami Ouchi,
Rhythm Shimakawa,
Ben Wang,
Weichen Wang,
Zheng Zheng
, et al. (1 additional authors not shown)
Abstract:
Circumgalactic Lyman-alpha (Ly$α$) nebulae are gaseous halos around galaxies exhibiting luminous extended Ly$α$ emission. This work investigates Ly$α$ nebulae from deep imaging of $\sim12~\mathrm{deg}^2$ sky, targeted by the MAMMOTH-Subaru survey. Utilizing the wide-field capability of Hyper Suprime-Cam (HSC), we present one of the largest blind Ly$α$ nebula selections, including QSO nebulae, Ly…
▽ More
Circumgalactic Lyman-alpha (Ly$α$) nebulae are gaseous halos around galaxies exhibiting luminous extended Ly$α$ emission. This work investigates Ly$α$ nebulae from deep imaging of $\sim12~\mathrm{deg}^2$ sky, targeted by the MAMMOTH-Subaru survey. Utilizing the wide-field capability of Hyper Suprime-Cam (HSC), we present one of the largest blind Ly$α$ nebula selections, including QSO nebulae, Ly$α$ blobs, and radio galaxy nebulae down to typical $2σ$ Ly$α$ surface brightness of $(5-10)\times10^{-18}\mathrm{~erg~s^{-1}~cm^{-2}~arcsec^{-2}}$. The sample contains 117 nebulae with Ly$α$ sizes of 40 - 400 kpc, and the most gigantic one spans about 365 kpc, referred to as the Ivory Nebula. Combining multiwavelength data, we investigate diverse nebula populations and associated galaxies. We find a small fraction of Ly$α$ nebulae have QSOs ($\sim7\%$), luminous infrared galaxies ($\sim1\%$), and radio galaxies ($\sim 2\%$). Remarkably, among the 28 enormous Ly$α$ nebulae (ELANe) exceeding 100 kpc, about 80\% are associated with UV-faint galaxies ($M_\mathrm{UV} > -22$), categorized as Type II ELANe. We underscore that Type II ELANe constitute the majority but remain largely hidden in current galaxy and QSO surveys. Dusty starburst and obscured AGN activity are proposed to explain the nature of Type II ELANe. The SED of stacking all Ly$α$ nebulae also reveals signs of massive dusty star-forming galaxies with obscured AGNs. We propose a model to explain the dusty nature where the diverse populations of Ly$α$ nebulae capture massive galaxies at different evolutionary stages undergoing violent assembling. Ly$α$ nebulae provide critical insights into the formation and evolution of today's massive cluster galaxies at cosmic noon.
△ Less
Submitted 26 September, 2024; v1 submitted 21 May, 2024;
originally announced May 2024.
-
Subdifferentially polynomially bounded functions and Gaussian smoothing-based zeroth-order optimization
Authors:
Ming Lei,
Ting Kei Pong,
Shuqin Sun,
Man-Chung Yue
Abstract:
We introduce the class of subdifferentially polynomially bounded (SPB) functions, which is a rich class of locally Lipschitz functions that encompasses all Lipschitz functions, all gradient- or Hessian-Lipschitz functions, and even some non-smooth locally Lipschitz functions. We show that SPB functions are compatible with Gaussian smoothing (GS), in the sense that the GS of any SPB function is wel…
▽ More
We introduce the class of subdifferentially polynomially bounded (SPB) functions, which is a rich class of locally Lipschitz functions that encompasses all Lipschitz functions, all gradient- or Hessian-Lipschitz functions, and even some non-smooth locally Lipschitz functions. We show that SPB functions are compatible with Gaussian smoothing (GS), in the sense that the GS of any SPB function is well-defined and satisfies a descent lemma akin to gradient-Lipschitz functions, with the Lipschitz constant replaced by a polynomial function. Leveraging this descent lemma, we propose GS-based zeroth-order optimization algorithms with an adaptive stepsize strategy for constrained minimization of SPB functions, and analyze their iteration complexity. An important instrument in our analysis, which could be of independent interest, is the quantification of Goldstein stationarity via the GS gradient.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Multigrid method for nonlinear eigenvalue problems based on Newton iteration
Authors:
Fei Xu,
Manting Xie,
Meiling Yue
Abstract:
In this paper, a novel multigrid method based on Newton iteration is proposed to solve nonlinear eigenvalue problems. Instead of handling the eigenvalue $λ$ and eigenfunction $u$ separately, we treat the eigenpair $(λ, u)$ as one element in a product space $\mathbb R \times H_0^1(Ω)$. Then in the presented multigrid method, only one discrete linear boundary value problem needs to be solved for eac…
▽ More
In this paper, a novel multigrid method based on Newton iteration is proposed to solve nonlinear eigenvalue problems. Instead of handling the eigenvalue $λ$ and eigenfunction $u$ separately, we treat the eigenpair $(λ, u)$ as one element in a product space $\mathbb R \times H_0^1(Ω)$. Then in the presented multigrid method, only one discrete linear boundary value problem needs to be solved for each level of the multigrid sequence. Because we avoid solving large-scale nonlinear eigenvalue problems directly, the overall efficiency is significantly improved. The optimal error estimate and linear computational complexity can be derived simultaneously. In addition, we also provide an improved multigrid method coupled with a mixing scheme to further guarantee the convergence and stability of the iteration scheme. More importantly, we prove convergence for the residuals after each iteration step. For nonlinear eigenvalue problems, such theoretical analysis is missing from the existing literatures on the mixing iteration scheme.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
A Spatially Resolved [CII] Survey of 31 $z\sim7$ Massive Galaxies Hosting Luminous Quasars
Authors:
Feige Wang,
Jinyi Yang,
Xiaohui Fan,
Bram Venemans,
Roberto Decarli,
Eduardo Bañados,
Fabian Walter,
Aaron J. Barth,
Fuyan Bian,
Frederick B. Davies,
Anna-Christina Eilers,
Emanuele Paolo Farina,
Joseph F. Hennawi,
Jiang-Tao Li,
Chiara Mazzucchelli,
Ran Wang,
Xue-Bing Wu,
Minghao Yue
Abstract:
The [CII] 158 $μ$m emission line and the underlying far-infrared (FIR) dust continuum are important tracers for studying star formation and kinematic properties of early galaxies. We present a survey of the [CII] emission lines and FIR continua of 31 luminous quasars at $z>6.5$ using the Atacama Large Millimeter Array (ALMA) and the NOrthern Extended Millimeter Array (NOEMA) at sub-arcsec resoluti…
▽ More
The [CII] 158 $μ$m emission line and the underlying far-infrared (FIR) dust continuum are important tracers for studying star formation and kinematic properties of early galaxies. We present a survey of the [CII] emission lines and FIR continua of 31 luminous quasars at $z>6.5$ using the Atacama Large Millimeter Array (ALMA) and the NOrthern Extended Millimeter Array (NOEMA) at sub-arcsec resolution. This survey more than doubles the number of quasars with [CII] and FIR observations at these redshifts and enables statistical studies of quasar host galaxies deep into the epoch of reionization. We detect [CII] emission in 27 quasar hosts with a luminosity range of $L_{\rm [CII]}=(0.3-5.5)\times10^9~L_\odot$ and detect the FIR continuum of 28 quasar hosts with a luminosity range of $L_{\rm FIR}=(0.5-13.0)\times10^{12}~L_\odot$. Both $L_{\rm [CII]}$ and $L_{\rm FIR}$ are correlated ($ρ\simeq0.4$) with the quasar bolometric luminosity, albeit with substantial scatter. The quasar hosts detected by ALMA are clearly resolved with a median diameter of $\sim$5 kpc. About 40% of the quasar host galaxies show a velocity gradient in [CII] emission, while the rest show either dispersion-dominated or disturbed kinematics. Basic estimates of the dynamical masses of the rotation-dominated host galaxies yield $M_{\rm dyn}=(0.1-7.5)\times10^{11}~M_\odot$. Considering our findings alongside those of literature studies, we found that the ratio between $M_{\rm BH}$ and $M_{\rm dyn}$ is about ten times higher than that of local $M_{\rm BH}-M_{\rm dyn}$ relation on average but with substantial scatter (the ratio difference ranging from $\sim$0.6 to 60) and large uncertainties.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Stacking X-ray Observations of "Little Red Dots": Implications for their AGN Properties
Authors:
Minghao Yue,
Anna-Christina Eilers,
Tonima Tasnim Ananna,
Christos Panagiotou,
Erin Kara,
Takamitsu Miyaji
Abstract:
Recent James Webb Space Telescope (JWST) observations have revealed a population of compact extragalactic objects at $z\gtrsim4$ with red near-infrared colors, which have been dubbed as ``Little Red Dots" (LRDs). The spectroscopically-selected LRDs exhibit broad H$α$ emission lines, which likely indicates that type-I active galactic nuclei (AGN) are harbored in the galaxies' dust-reddened cores. H…
▽ More
Recent James Webb Space Telescope (JWST) observations have revealed a population of compact extragalactic objects at $z\gtrsim4$ with red near-infrared colors, which have been dubbed as ``Little Red Dots" (LRDs). The spectroscopically-selected LRDs exhibit broad H$α$ emission lines, which likely indicates that type-I active galactic nuclei (AGN) are harbored in the galaxies' dust-reddened cores. However, other mechanisms, like strong outflowing winds, could also produce broad H$α$ emission lines, and thus, the nature of LRDs is still under debate. We test the AGN hypothesis for LRDs by stacking the archival {\em Chandra} observations of 34 spectroscopically-selected LRDs. We obtain tentative detections in the soft $(0.5-2\text{ keV})$ and hard $(2-8\text{ keV})$ X-ray bands with $2.9σ$ and $3.2σ$ significance, and with $4.1σ$ significance when combining the two bands. Nevertheless, we find that the soft (hard) band $3σ$ upper limit is $\sim1$dex ($\sim 0.3$dex) lower than the expected level from the $L_\text{X}-L_{\text{H}α}$ relation for typical type-I AGNs. Our results indicate that AGN activity is indeed likely present in LRDs, though these objects have significantly different properties compared to previously identified type-I AGNs, i.e., LRDs may have intrinsically weak X-ray emissions. We find it difficult to explain the low $L_\text{X}/L_{\text{H}α}$ ratios observed in LRDs solely by absorption. It is also unlikely that fast outflows have major contributions to the broad H$α$ lines. Our findings indicate that empirical relations (e.g., for black hole mass measurements) established for typical type-I AGNs should be used with caution when analyzing the properties of LRDs.
△ Less
Submitted 12 September, 2024; v1 submitted 20 April, 2024;
originally announced April 2024.
-
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Authors:
Murong Yue,
Wijdane Mifdal,
Yixuan Zhang,
Jennifer Suh,
Ziyu Yao
Abstract:
Mathematical modeling (MM) is considered a fundamental skill for students in STEM disciplines. Practicing the MM skill is often the most effective when students can engage in group discussion and collaborative problem-solving. However, due to unevenly distributed teachers and educational resources needed to monitor such group activities, students do not always receive equal opportunities for this…
▽ More
Mathematical modeling (MM) is considered a fundamental skill for students in STEM disciplines. Practicing the MM skill is often the most effective when students can engage in group discussion and collaborative problem-solving. However, due to unevenly distributed teachers and educational resources needed to monitor such group activities, students do not always receive equal opportunities for this practice. Excitingly, large language models (LLMs) have recently demonstrated strong capability in both modeling mathematical problems and simulating characters with different traits and properties. Drawing inspiration from the advancement of LLMs, in this work, we present MATHVC, the very first LLM-powered virtual classroom containing multiple LLM-simulated student characters, with whom a human student can practice their MM skill. To encourage each LLM character's behaviors to be aligned with their specified math-relevant properties (termed "characteristics alignment") and the overall conversational procedure to be close to an authentic student MM discussion (termed "conversational procedural alignment"), we proposed three innovations: integrating MM domain knowledge into the simulation, defining a symbolic schema as the ground for character simulation, and designing a meta planner at the platform level to drive the conversational procedure. Through experiments and ablation studies, we confirmed the effectiveness of our simulation approach and showed the promise for MATHVC to benefit real-life students in the future.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
A Max-Min-Max Algorithm for Large-Scale Robust Optimization
Authors:
Kai Tu,
Zhi Chen,
Man-Chung Yue
Abstract:
Robust optimization (RO) is a powerful paradigm for decision making under uncertainty. Existing algorithms for solving RO, including the reformulation approach and the cutting-plane method, do not scale well, hindering the application of RO to large-scale decision problems. In this paper, we devise a first-order algorithm for solving RO based on a novel max-min-max perspective. Our algorithm opera…
▽ More
Robust optimization (RO) is a powerful paradigm for decision making under uncertainty. Existing algorithms for solving RO, including the reformulation approach and the cutting-plane method, do not scale well, hindering the application of RO to large-scale decision problems. In this paper, we devise a first-order algorithm for solving RO based on a novel max-min-max perspective. Our algorithm operates directly on the model functions and sets through the subgradient and projection oracles, which enables the exploitation of problem structures and is especially suitable for large-scale RO. Theoretically, we prove that the oracle complexity of our algorithm for attaining an $\varepsilon$-approximate optimal solution is $\mathcal{O}(\varepsilon^{-3})$ or $\mathcal{O}(\varepsilon^{-2})$, depending on the smoothness of the model functions. The algorithm and its theoretical results are then extended to RO with projection-unfriendly uncertainty sets. We also show via extensive numerical experiments that the proposed algorithm outperforms the reformulation approach, the cutting-plane method and two other recent first-order algorithms.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
An MILP-Based Solution Scheme for Factored and Robust Factored Markov Decision Processes
Authors:
Huikang Liu,
Wolfram Wiesemann,
Man-Chung Yue
Abstract:
Factored Markov decision processes (MDPs) are a prominent paradigm within the artificial intelligence community for modeling and solving large-scale MDPs whose rewards and dynamics decompose into smaller, loosely interacting components. Through the use of dynamic Bayesian networks and context-specific independence, factored MDPs can achieve an exponential reduction in the state space of an MDP and…
▽ More
Factored Markov decision processes (MDPs) are a prominent paradigm within the artificial intelligence community for modeling and solving large-scale MDPs whose rewards and dynamics decompose into smaller, loosely interacting components. Through the use of dynamic Bayesian networks and context-specific independence, factored MDPs can achieve an exponential reduction in the state space of an MDP and thus scale to problem sizes that are beyond the reach of classical MDP algorithms. However, factored MDPs are typically solved using custom-designed algorithms that can require meticulous implementations and considerable fine-tuning. In this paper, we propose a mathematical programming approach to solving factored MDPs. In contrast to existing solution schemes, our approach leverages off-the-shelf solvers, which allows for a streamlined implementation and maintenance; it effectively capitalizes on the factored structure present in both state and action spaces; and it readily extends to the largely unexplored class of robust factored MDPs, whose transition kernels are only known to reside in a pre-specified ambiguity set. Our numerical experiments demonstrate the potential of our approach.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
A unified model for the clustering of quasars and galaxies at $z\approx6$
Authors:
Elia Pizzati,
Joseph F. Hennawi,
Joop Schaye,
Matthieu Schaller,
Anna-Christina Eilers,
Feige Wang,
Carlos S. Frenk,
Willem Elbers,
John C. Helly,
Ruari Mackenzie,
Jorryt Matthee,
Rongmon Bordoloi,
Daichi Kashino,
Rohan P. Naidu,
Minghao Yue
Abstract:
Recent observations from the EIGER JWST program have measured for the first time the quasar-galaxy cross-correlation function at $z\approx6$. The auto-correlation function of faint $z\approx6$ quasars was also recently estimated. These measurements provide key insights into the properties of quasars and galaxies at high redshift and their relation with the host dark matter halos. In this work, we…
▽ More
Recent observations from the EIGER JWST program have measured for the first time the quasar-galaxy cross-correlation function at $z\approx6$. The auto-correlation function of faint $z\approx6$ quasars was also recently estimated. These measurements provide key insights into the properties of quasars and galaxies at high redshift and their relation with the host dark matter halos. In this work, we interpret these data building upon an empirical quasar population model that has been applied successfully to quasar clustering and demographic measurements at $z\approx2-4$. We make use of a new, large-volume N-body simulation with more than a trillion particles, FLAMINGO-10k, to model quasars and galaxies simultaneously. We successfully reproduce observations of $z\approx6$ quasars and galaxies (i.e., their clustering properties and luminosity functions), and infer key quantities such as their luminosity-halo mass relation, the mass function of their host halos, and their duty cycle/occupation fraction. Our key findings are: (i) quasars reside on average in $\approx10^{12.5}\,{\rm M}_\odot$ halos (corresponding to $\approx5σ$ fluctuations in the initial conditions of the linear density field), but the distribution of host halo masses is quite broad; (ii) the duty cycle of (UV-bright) quasar activity is relatively low ($\approx1\%$); (iii) galaxies (that are bright in [OIII]) live in much smaller halos ($\approx10^{10.9}\,{\rm M}_\odot$) and have a larger duty cycle (occupation fraction) of $\approx13\%$. Finally, we focus on the inferred properties of quasars and present a homogeneous analysis of their evolution with redshift. The picture that emerges reveals a strong evolution of the host halo mass and duty cycle of quasars at $z\approx2-6$, and calls for new investigations of the role of quasar activity across cosmic time.
△ Less
Submitted 5 October, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
EIGER VI. The Correlation Function, Host Halo Mass and Duty Cycle of Luminous Quasars at $z\gtrsim6$
Authors:
Anna-Christina Eilers,
Ruari Mackenzie,
Elia Pizzati,
Jorryt Matthee,
Joseph F. Hennawi,
Haowen Zhang,
Rongmon Bordoloi,
Daichi Kashino,
Simon J. Lilly,
Rohan P. Naidu,
Robert A. Simcoe,
Minghao Yue,
Carlos S. Frenk,
John C. Helly,
Matthieu Schaller,
Joop Schaye
Abstract:
We expect luminous ($M_{1450}\lesssim-26.5$) high-redshift quasars to trace the highest density peaks in the early universe. Here, we present observations of four $z\gtrsim6$ quasar fields using JWST/NIRCam in imaging and widefield slitless spectroscopy mode and report a wide range in the number of detected [OIII]-emitting galaxies in the quasars' environments, ranging between a density enhancemen…
▽ More
We expect luminous ($M_{1450}\lesssim-26.5$) high-redshift quasars to trace the highest density peaks in the early universe. Here, we present observations of four $z\gtrsim6$ quasar fields using JWST/NIRCam in imaging and widefield slitless spectroscopy mode and report a wide range in the number of detected [OIII]-emitting galaxies in the quasars' environments, ranging between a density enhancement of $δ\approx65$ within a $2$ cMpc radius - one of the largest proto-clusters during the Epoch of Reionization discovered to date - to a density contrast consistent with zero, indicating the presence of a UV-luminous quasar in a region comparable to the average density of the universe. By measuring the two-point cross-correlation function of quasars and their surrounding galaxies, as well as the galaxy auto-correlation function, we infer a correlation length of quasars at $\langle z\rangle=6.25$ of $r_0^{\rm QQ}=22.0^{+3.0}_{-2.9}~{\rm cMpc}\,h^{-1}$, while we obtain a correlation length of the [OIII]-emitting galaxies of $r_0^{\rm GG}=4.1\pm0.3~{\rm cMpc}\,h^{-1}$. By comparing the correlation functions to dark-matter-only simulations we estimate the minimum mass of the quasars' host dark matter halos to be $\log_{10}(M_{\rm halo, min}/M_\odot)=12.43^{+0.13}_{-0.15}$ (and $\log_{10}(M_{\rm halo, min}^{\rm [OIII]}/M_\odot) = 10.56^{+0.05}_{-0.03}$ for the [OIII]-emitters), indicating that (a) luminous quasars do not necessarily reside within the most overdense regions in the early universe, and that (b) the UV-luminous duty cycle of quasar activity at these redshifts is $f_{\rm duty}\ll1$. Such short quasar activity timescales challenge our understanding of early supermassive black hole growth and provide evidence for highly dust-obscured growth phases or episodic, radiatively inefficient accretion rates.
△ Less
Submitted 4 September, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
An Approach to Evaluate Modeling Adequacy for Small-Signal Stability Analysis of IBR-related SSOs in Multimachine Systems
Authors:
Lilan Karunaratne,
Nilanjan Ray Chaudhuri,
Amirthagunaraj Yogarathnam,
Meng Yue
Abstract:
Time-varying phasor-based analysis of subsynchronous oscillations (SSOs) involving grid-following converters (GFLCs) and its benchmarking with electromagnetic transient (EMT) models have so far been restricted to highly simplified grid models with constant voltage sources behind series R-L circuits. In this paper, modeling adequacy of bulk power systems with synchronous generators (SGs), transmiss…
▽ More
Time-varying phasor-based analysis of subsynchronous oscillations (SSOs) involving grid-following converters (GFLCs) and its benchmarking with electromagnetic transient (EMT) models have so far been restricted to highly simplified grid models with constant voltage sources behind series R-L circuits. In this paper, modeling adequacy of bulk power systems with synchronous generators (SGs), transmission systems, loads, and GFLCs are considered. To this end, we revisit the notions of time-varying phasor calculus, highlighting the distinction between space-phasor-calculus (SPC) and two often interchangeably used frameworks namely baseband-abc and generalized averaging. We present the models of grids in SPC framework that include transmission line dynamics, load dynamics, and SG stator transients. Next, we propose a generic approach to study modeling adequacy in small-signal sense by (a) identifying critical modes through eigenvalue and singular value analysis followed by (b) using weighted maximum singular value error magnitudes as metrics, and (c) further cross-validation. Using a modified 4-machine IEEE benchmark model with up to 3 GFLCs we show that SPC framework can be used for analysis of SSOs. Further, we consider the quasistationary phasor calculus (QPC) framework that neglects transmission line, load, and SG stator dynamics to show its adequacy in SSO modeling and analysis. Time-domain and frequency-domain results with EMT models are also presented.
△ Less
Submitted 24 June, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
AMUSE: Adaptive Multi-Segment Encoding for Dataset Watermarking
Authors:
Saeed Ranjbar Alvar,
Mohammad Akbari,
David Ming Xuan Yue,
Yong Zhang
Abstract:
Curating high quality datasets that play a key role in the emergence of new AI applications requires considerable time, money, and computational resources. So, effective ownership protection of datasets is becoming critical. Recently, to protect the ownership of an image dataset, imperceptible watermarking techniques are used to store ownership information (i.e., watermark) into the individual ima…
▽ More
Curating high quality datasets that play a key role in the emergence of new AI applications requires considerable time, money, and computational resources. So, effective ownership protection of datasets is becoming critical. Recently, to protect the ownership of an image dataset, imperceptible watermarking techniques are used to store ownership information (i.e., watermark) into the individual image samples. Embedding the entire watermark into all samples leads to significant redundancy in the embedded information which damages the watermarked dataset quality and extraction accuracy. In this paper, a multi-segment encoding-decoding method for dataset watermarking (called AMUSE) is proposed to adaptively map the original watermark into a set of shorter sub-messages and vice versa. Our message encoder is an adaptive method that adjusts the length of the sub-messages according to the protection requirements for the target dataset. Existing image watermarking methods are then employed to embed the sub-messages into the original images in the dataset and also to extract them from the watermarked images. Our decoder is then used to reconstruct the original message from the extracted sub-messages. The proposed encoder and decoder are plug-and-play modules that can easily be added to any watermarking method. To this end, extensive experiments are preformed with multiple watermarking solutions which show that applying AMUSE improves the overall message extraction accuracy upto 28% for the same given dataset quality. Furthermore, the image dataset quality is enhanced by a PSNR of $\approx$2 dB on average, while improving the extraction accuracy for one of the tested image watermarking methods.
△ Less
Submitted 18 July, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
A Massive Protocluster Anchored by a Luminous Quasar at $z=6.63$
Authors:
Feige Wang,
Jinyi Yang,
Joseph F. Hennawi,
Xiaohui Fan,
Minghao Yue,
Eduardo Bañados,
Shane Bechtel,
Fuyan Bian,
Sarah Bosman,
Jaclyn B. Champagne,
Frederick B. Davies,
Roberto Decarli,
Emanuele Paolo Farina,
Chiara Mazzucchelli,
Bram Venemans,
Fabian Walter
Abstract:
Protoclusters, the progenitors of galaxy clusters, trace large scale structures in the early Universe and are important to our understanding of structure formation and galaxy evolution. To date, only a handful of protoclusters have been identified in the Epoch of Reionization (EoR). As one of the rarest populations in the early Universe, distant quasars that host active supermassive black holes ar…
▽ More
Protoclusters, the progenitors of galaxy clusters, trace large scale structures in the early Universe and are important to our understanding of structure formation and galaxy evolution. To date, only a handful of protoclusters have been identified in the Epoch of Reionization (EoR). As one of the rarest populations in the early Universe, distant quasars that host active supermassive black holes are thought to reside in the most massive dark matter halos at that cosmic epoch, and could thus potentially pinpoint some of the earliest protoclusters. In this letter, we report the discovery of a massive protocluster around a luminous quasar at $z=6.63$. This protocluster is anchored by the quasar, and includes three [CII] emitters at $z\sim6.63$, 12 spectroscopically confirmed Ly$α$ emitters (LAEs) at $6.54<z\le6.64$, and a large number of narrow-band imaging selected LAE candidates at the same redshift. This structure has an overall overdensity of $δ=3.3^{+1.1}_{-0.9}$ within $\sim35\times74$ cMpc$^2$ on the sky and an extreme overdensity of $δ>30$ in its central region (i.e., $R\lesssim2$ cMpc). We estimate that this protocluster will collapse into a galaxy cluster with a mass of $6.9^{+1.2}_{-1.4}\times10^{15}~M_\odot$ at the current epoch, more massive than the most massive clusters known in the local Universe such as Coma. In the quasar vicinity, we discover a double-peaked LAE which implies that the quasar has a UV lifetime greater than 0.8 Myrs and has already ionized its surrounding intergalactic medium.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Evaluating User Experience and Data Quality in Gamified Data Collection for Appearance-Based Gaze Estimation
Authors:
Mingtao Yue,
Tomomi Sayuda,
Miles Pennington,
Yusuke Sugano
Abstract:
Appearance-based gaze estimation, which uses only a regular camera to estimate human gaze, is important in various application fields. While the technique faces data bias issues, data collection protocol is often demanding, and collecting data from a wide range of participants is difficult. It is an important challenge to design opportunities that allow a diverse range of people to participate whi…
▽ More
Appearance-based gaze estimation, which uses only a regular camera to estimate human gaze, is important in various application fields. While the technique faces data bias issues, data collection protocol is often demanding, and collecting data from a wide range of participants is difficult. It is an important challenge to design opportunities that allow a diverse range of people to participate while ensuring the quality of the training data. To tackle this challenge, we introduce a novel gamified approach for collecting training data. In this game, two players communicate words via eye gaze through a transparent letter board. Images captured during gameplay serve as valuable training data for gaze estimation models. The game is designed as a physical installation that involves communication between players, and it is expected to attract the interest of diverse participants. We assess the game's significance on data quality and user experience through a comparative user study.
△ Less
Submitted 2 September, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.
-
XMM-Newton-discovered Fast X-ray Transients: Host galaxies and limits on contemporaneous detections of optical counterparts
Authors:
D. Eappachen,
P. G. Jonker,
J. Quirola-Vásquez,
D. Mata Sánchez,
A. Inkenhaag,
A. J. Levan,
M. Fraser,
M. A. P. Torres,
F. E. Bauer,
A. A. Chrimes,
D. Stern,
M. J. Graham,
S. J. Smartt,
K. W. Smith,
M. E. Ravasio,
A. I. Zabludoff,
M. Yue,
F. Stoppa,
D. B. Malesani,
N. C. Stone,
S. Wen
Abstract:
Extragalactic fast X-ray transients (FXTs) are a class of soft (0.3-10 keV) X-ray transients lasting a few hundred seconds to several hours. Several progenitor mechanisms have been suggested to produce FXTs, including supernova shock breakouts, binary neutron star mergers, or tidal disruptions involving an intermediate-mass black hole and a white dwarf. We present detailed host studies, including…
▽ More
Extragalactic fast X-ray transients (FXTs) are a class of soft (0.3-10 keV) X-ray transients lasting a few hundred seconds to several hours. Several progenitor mechanisms have been suggested to produce FXTs, including supernova shock breakouts, binary neutron star mergers, or tidal disruptions involving an intermediate-mass black hole and a white dwarf. We present detailed host studies, including spectroscopic observations of the host galaxies of 7 XMM-Newton-discovered FXTs. The candidate hosts lie at redshifts 0.0928 $< z <$ 0.645 implying peak X-ray luminosities of 10$^{43}$ erg s$^{-1}$ $< L_X <$ 10$^{45}$ erg s$^{-1}$,and physical offsets of 1 kpc < $r_\mathrm{proj}$ < 22 kpc. These observations increase the number of FXTs with a spectroscopic redshift measurement by a factor of 2, although we note that one event is re-identified as a Galactic flare star. We infer host star formation rates and stellar masses by fitting the combined spectroscopic and archival photometric data. We also report on a contemporaneous optical counterpart search to the FXTs in Pan-STARRS and ATLAS by performing forced photometry at the position of the FXTs. We do not find any counterpart in our search. Given our constraints, including peak X-ray luminosities, optical limits, and host properties, we find that XRT 110621 is consistent with a SN SBO event. Spectroscopic redshifts of likely host galaxies for four events imply peak X-ray luminosities that are too high to be consistent with SN SBOs, but we are unable to discard either the BNS or WD-IMBH TDE scenarios for these FXTs.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Can LLM find the green circle? Investigation and Human-guided tool manipulation for compositional generalization
Authors:
Min Zhang,
Jianfeng He,
Shuo Lei,
Murong Yue,
Linhang Wang,
Chang-Tien Lu
Abstract:
The meaning of complex phrases in natural language is composed of their individual components. The task of compositional generalization evaluates a model's ability to understand new combinations of components. Previous studies trained smaller, task-specific models, which exhibited poor generalization. While large language models (LLMs) exhibit impressive generalization abilities on many tasks thro…
▽ More
The meaning of complex phrases in natural language is composed of their individual components. The task of compositional generalization evaluates a model's ability to understand new combinations of components. Previous studies trained smaller, task-specific models, which exhibited poor generalization. While large language models (LLMs) exhibit impressive generalization abilities on many tasks through in-context learning (ICL), their potential for compositional generalization remains unexplored. In this paper, we first empirically investigate prevailing ICL methods in compositional generalization. We find that they struggle with complex compositional questions due to cumulative errors in long reasoning steps and intricate logic required for tool-making. Consequently, we propose a human-guided tool manipulation framework (HTM) that generates tools for sub-questions and integrates multiple tools. Our method enhances the effectiveness of tool creation and usage with minimal human effort. Experiments show that our method achieves state-of-the-art performance on two compositional generalization benchmarks and outperforms existing methods on the most challenging test split by 70%.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Integrating Communication, Sensing and Computing in Satellite Internet of Things: Challenges and Opportunities
Authors:
Yong Zuo,
Mingyang Yue,
Huiyuan Yang,
Liantao Wu,
Xiaojun Yuan
Abstract:
Satellite Internet of Things (IoT) is to use satellites as the access points for IoT devices to achieve the global coverage of future IoT systems, and is expected to support burgeoning IoT applications, including communication, sensing, and computing. However, the complex and dynamic satellite environments and limited network resources raise new challenges in the design of satellite IoT systems. I…
▽ More
Satellite Internet of Things (IoT) is to use satellites as the access points for IoT devices to achieve the global coverage of future IoT systems, and is expected to support burgeoning IoT applications, including communication, sensing, and computing. However, the complex and dynamic satellite environments and limited network resources raise new challenges in the design of satellite IoT systems. In this article, we focus on the joint design of communication, sensing, and computing to improve the performance of satellite IoT, which is quite different from the case of terrestrial IoT systems. We describe how the integration of the three functions can enhance system capabilities, and summarize the state-of-the-art solutions. Furthermore, we discuss the main challenges of integrating communication, sensing, and computing in satellite IoT to be solved with pressing interest.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
Coverage-Validity-Aware Algorithmic Recourse
Authors:
Ngoc Bui,
Duy Nguyen,
Man-Chung Yue,
Viet Anh Nguyen
Abstract:
Algorithmic recourse emerges as a prominent technique to promote the explainability, transparency and hence ethics of machine learning models. Existing algorithmic recourse approaches often assume an invariant predictive model; however, the predictive model is usually updated upon the arrival of new data. Thus, a recourse that is valid respective to the present model may become invalid for the fut…
▽ More
Algorithmic recourse emerges as a prominent technique to promote the explainability, transparency and hence ethics of machine learning models. Existing algorithmic recourse approaches often assume an invariant predictive model; however, the predictive model is usually updated upon the arrival of new data. Thus, a recourse that is valid respective to the present model may become invalid for the future model. To resolve this issue, we propose a novel framework to generate a model-agnostic recourse that exhibits robustness to model shifts. Our framework first builds a coverage-validity-aware linear surrogate of the nonlinear (black-box) model; then, the recourse is generated with respect to the linear surrogate. We establish a theoretical connection between our coverage-validity-aware linear surrogate and the minimax probability machines (MPM). We then prove that by prescribing different covariance robustness, the proposed framework recovers popular regularizations for MPM, including the $\ell_2$-regularization and class-reweighting. Furthermore, we show that our surrogate pushes the approximate hyperplane intuitively, facilitating not only robust but also interpretable recourses. The numerical results demonstrate the usefulness and robustness of our framework.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
Cooperative Multi-Agent Deep Reinforcement Learning for Adaptive Decentralized Emergency Voltage Control
Authors:
Ying Zhang,
Meng Yue
Abstract:
Under voltage load shedding (UVLS) for power grid emergency control builds the last defensive perimeter to prevent cascade outages and blackouts in case of contingencies. This letter proposes a novel cooperative multi-agent deep reinforcement learning (MADRL)-based UVLS algorithm in an adaptive decentralized way. With well-designed input signals reflecting the voltage deviation, newly structured n…
▽ More
Under voltage load shedding (UVLS) for power grid emergency control builds the last defensive perimeter to prevent cascade outages and blackouts in case of contingencies. This letter proposes a novel cooperative multi-agent deep reinforcement learning (MADRL)-based UVLS algorithm in an adaptive decentralized way. With well-designed input signals reflecting the voltage deviation, newly structured neural networks are developed as intelligent agents to obtain control actions and their probabilities to accommodate high uncertainties in volatile power system operations. Moreover, the interaction among the agents for coordinated control is implemented and refined by a state-of-the-art attention mechanism, which helps agents concentratively learn effective interacted information. The proposed method realizes decentralized coordinated control, adapting to extremely high uncertainties. Case studies on an IEEE benchmark system indicate the superior performance of the proposed algorithm.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Authors:
Murong Yue,
Jie Zhao,
Min Zhang,
Liang Du,
Ziyu Yao
Abstract:
Large language models (LLMs) such as GPT-4 have exhibited remarkable performance in a variety of tasks, but this strong performance often comes with the high expense of using paid API services. In this paper, we are motivated to study building an LLM cascade to save the cost of using LLMs, particularly for performing reasoning (e.g., mathematical, causal) tasks. Our cascade pipeline follows the in…
▽ More
Large language models (LLMs) such as GPT-4 have exhibited remarkable performance in a variety of tasks, but this strong performance often comes with the high expense of using paid API services. In this paper, we are motivated to study building an LLM cascade to save the cost of using LLMs, particularly for performing reasoning (e.g., mathematical, causal) tasks. Our cascade pipeline follows the intuition that simpler questions can be addressed by a weaker but more affordable LLM, whereas only the challenging questions necessitate the stronger and more expensive LLM. To realize this decision-making, we consider the "answer consistency" of the weaker LLM as a signal of the question difficulty and propose several methods for the answer sampling and consistency checking, including one leveraging a mixture of two thought representations (i.e., Chain-of-Thought and Program-of-Thought). Through experiments on six reasoning benchmark datasets, with GPT-3.5-turbo and GPT-4 being the weaker and stronger LLMs, respectively, we demonstrate that our proposed LLM cascades can achieve performance comparable to using solely the stronger LLM but require only 40% of its cost.
△ Less
Submitted 8 February, 2024; v1 submitted 4 October, 2023;
originally announced October 2023.
-
EIGER V. Characterizing the Host Galaxies of Luminous Quasars at $z\gtrsim6$
Authors:
Minghao Yue,
Anna-Christina Eilers,
Robert A. Simcoe,
Ruari Mackenzie,
Jorryt Matthee,
Daichi Kashino,
Rongmon Bordoloi,
Simon J. Lilly,
Rohan P. Naidu
Abstract:
We report {\em JWST}/NIRCam measurements of quasar host galaxy emissions and supermassive black hole (SMBH) masses for six quasars at $5.9<z<7.1$ in the \textit{Emission-line galaxies and Intergalactic Gas in the Epoch of Reionization} (EIGER) project. We obtain deep NIRCam imaging in the F115W, F200W, and F356W bands, as well as F356W grism spectroscopy of the quasars. We use bright unsaturated s…
▽ More
We report {\em JWST}/NIRCam measurements of quasar host galaxy emissions and supermassive black hole (SMBH) masses for six quasars at $5.9<z<7.1$ in the \textit{Emission-line galaxies and Intergalactic Gas in the Epoch of Reionization} (EIGER) project. We obtain deep NIRCam imaging in the F115W, F200W, and F356W bands, as well as F356W grism spectroscopy of the quasars. We use bright unsaturated stars to construct models of the point spread function (PSF) and estimate the errors of these PSFs. We then measure or constrain the fluxes and morphology of the quasar host galaxies by fitting the quasar images as a point source plus an exponential disk. We successfully detect the host galaxy of three quasars, which have host-to-quasar flux ratios of $\sim1\%-5\%$. Spectral Energy Distribution (SED) fitting suggests that these quasar host galaxies have stellar masses of $M_*\gtrsim10^{10}M_\odot$. For quasars with host galaxy non-detections, we estimate the upper limits of their stellar masses. We use the grism spectra to measure the {\hb} line profile and the continuum luminosity, then estimate the SMBH masses for the quasars. Our results indicate that the positive relation between SMBH masses and host galaxy stellar masses already exists at redshift $z\gtrsim6$. The quasars in our sample show a high black hole to stellar mass ratio of $M_\text{BH}/M_*\sim0.15$, which is about $\sim2$ dex higher than local relations. We find that selection effects only contribute partially to the high $M_\text{BH}/M_*$ ratios of high-redshift quasars. This result hints at a possible redshift evolution of the $M_\text{BH}-M_*$ relation.
△ Less
Submitted 28 March, 2024; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Numerical strategy on the grid orientation effect in the simulation for two-phase flow in porous media by using the adaptive artificial viscosity method
Authors:
Xiao-Hong Wang,
Meng-Chen Yue,
Zhi-Feng Liu,
Wei-Dong Cao,
Yong Wang,
Jun Hu,
Chang-Hao Xiao,
Yao-Yong Li
Abstract:
It is a challenge to numerically solve nonlinear partial differential equations whose solution involves discontinuity. In the context of numerical simulators for multi-phase flow in porous media, there exists a long-standing issue known as Grid Orientation Effect (GOE), wherein different numerical solutions can be obtained when considering grids with different orientations under certain unfavorabl…
▽ More
It is a challenge to numerically solve nonlinear partial differential equations whose solution involves discontinuity. In the context of numerical simulators for multi-phase flow in porous media, there exists a long-standing issue known as Grid Orientation Effect (GOE), wherein different numerical solutions can be obtained when considering grids with different orientations under certain unfavorable conditions. Our perspective is that GOE arises due to numerical instability near displacement fronts, where spurious oscillations accompanied by sharp fronts, if not adequately suppressed, lead to GOE. To reduce or even eliminate GOE, we propose augmenting adaptive artificial viscosity when solving the saturation equation. It has been demonstrated that appropriate artificial viscosity can effectively reduce or even eliminate GOE. The proposed numerical method can be easily applied in practical engineering problems.
△ Less
Submitted 13 August, 2023;
originally announced August 2023.
-
Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Authors:
Binfeng Xu,
Xukun Liu,
Hua Shen,
Zeyu Han,
Yuhan Li,
Murong Yue,
Zhiyuan Peng,
Yuchen Liu,
Ziyu Yao,
Dongkuan Xu
Abstract:
Augmented Language Models (ALMs) empower large language models with the ability to use tools, transforming them into intelligent agents for real-world interactions. However, most existing frameworks for ALMs, to varying degrees, are deficient in the following critical features: flexible customization, collaborative democratization, and holistic evaluation. We present gentopia, an ALM framework ena…
▽ More
Augmented Language Models (ALMs) empower large language models with the ability to use tools, transforming them into intelligent agents for real-world interactions. However, most existing frameworks for ALMs, to varying degrees, are deficient in the following critical features: flexible customization, collaborative democratization, and holistic evaluation. We present gentopia, an ALM framework enabling flexible customization of agents through simple configurations, seamlessly integrating various language models, task formats, prompting modules, and plugins into a unified paradigm. Furthermore, we establish gentpool, a public platform enabling the registration and sharing of user-customized agents. Agents registered in gentpool are composable such that they can be assembled together for agent collaboration, advancing the democratization of artificial intelligence. To ensure high-quality agents, gentbench, an integral component of gentpool, is designed to thoroughly evaluate user-customized agents across diverse aspects such as safety, robustness, efficiency, etc. We release gentopia on Github and will continuously move forward.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations
Authors:
Xiaolei Diao,
Daqian Shi,
Jian Li,
Lida Shi,
Mingzhe Yue,
Ruihua Qi,
Chuntao Li,
Hao Xu
Abstract:
Optical character recognition (OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently, zero-shot OCR has piqued the interest of the research community because it considers a practical OCR scenario with unbalanced data distribution. However, there is a lack of benchmarks for evaluating such zero-shot methods that apply a divide-and-conque…
▽ More
Optical character recognition (OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently, zero-shot OCR has piqued the interest of the research community because it considers a practical OCR scenario with unbalanced data distribution. However, there is a lack of benchmarks for evaluating such zero-shot methods that apply a divide-and-conquer recognition strategy by decomposing characters into radicals. Meanwhile, radical recognition, as another important OCR task, also lacks radical-level annotation for model training. In this paper, we construct an ancient Chinese character image dataset that contains both radical-level and character-level annotations to satisfy the requirements of the above-mentioned methods, namely, ACCID, where radical-level annotations include radical categories, radical locations, and structural relations. To increase the adaptability of ACCID, we propose a splicing-based synthetic character algorithm to augment the training samples and apply an image denoising method to improve the image quality. By introducing character decomposition and recombination, we propose a baseline method for zero-shot OCR. The experimental results demonstrate the validity of ACCID and the baseline model quantitatively and qualitatively.
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
EIGER IV: The cool 10$^4$K circumgalactic environment of high-$z$ galaxies reveals remarkably efficient IGM enrichment
Authors:
Rongmon Bordoloi,
Robert A. Simcoe,
Jorryt Matthee,
Daichi Kashino,
Ruari Mackenzie,
Simon J. Lilly,
Anna-Christina Eilers,
Bin Liu,
David DePalma,
Minghao Yue,
Rohan P. Naidu
Abstract:
We report new observations of the cool diffuse gas around 29, $2.3<z<6.3$ galaxies, using deep JWST/NIRCam slitless grism spectroscopy around the sightline to the quasar J0100+2802. The galaxies span a stellar mass range of $7.1 \leq \log M_{*}/M_{sun} \leq 10.7$, and star-formation rates of $-0.1 < \log \; SFR/M_{sun}yr^{-1} \; <2.3$. We find galaxies for seven MgII absorption systems within 300…
▽ More
We report new observations of the cool diffuse gas around 29, $2.3<z<6.3$ galaxies, using deep JWST/NIRCam slitless grism spectroscopy around the sightline to the quasar J0100+2802. The galaxies span a stellar mass range of $7.1 \leq \log M_{*}/M_{sun} \leq 10.7$, and star-formation rates of $-0.1 < \log \; SFR/M_{sun}yr^{-1} \; <2.3$. We find galaxies for seven MgII absorption systems within 300 kpc of the quasar sightline. The MgII radial absorption profile falls off sharply with radii, with most of the absorption extending out to 2-3$R_{200}$ of the host galaxies. Six out of seven MgII absorption systems are detected around galaxies with $\log M_{*}/M_{sun} >$9. MgII absorption kinematics are shifted from the systemic redshift of host galaxies with a median absolute velocity of 135 km/s and standard deviation of 85 km/s. The high kinematic offset and large radial separation ($R> 1.3 R_{200}$), suggest that five out of the seven MgII absorption systems are gravitationally not bound to the galaxies. In contrast, most cool circumgalactic media at $z<1$ are gravitationally bound. The high incidence of unbound MgII gas in this work suggests that towards the end of reionization, galaxy halos are in a state of remarkable disequilibrium, and are highly efficient in enriching the intergalactic medium. Two strongest MgII absorption systems are detected at $z\sim$ 4.22 and 4.5, the former associated with a merging galaxy system and the latter associated with three kinematically close galaxies. Both these galaxies reside in local galaxy over-densities, indicating the presence of cool MgII absorption in two "proto-groups" at $z>4$.
△ Less
Submitted 8 January, 2024; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Streamlined Lensed Quasar Identification in Multiband Images via Ensemble Networks
Authors:
Irham Taufik Andika,
Sherry H. Suyu,
Raoul Ca�ameras,
Alejandra Melo,
Stefan Schuldt,
Yiping Shu,
Anna-Christina Eilers,
Anton Timur Jaelani,
Minghao Yue
Abstract:
Quasars experiencing strong lensing offer unique viewpoints on subjects related to the cosmic expansion rate, the dark matter profile within the foreground deflectors, and the quasar host galaxies. Unfortunately, identifying them in astronomical images is challenging since they are overwhelmed by the abundance of non-lenses. To address this, we have developed a novel approach by ensembling cutting…
▽ More
Quasars experiencing strong lensing offer unique viewpoints on subjects related to the cosmic expansion rate, the dark matter profile within the foreground deflectors, and the quasar host galaxies. Unfortunately, identifying them in astronomical images is challenging since they are overwhelmed by the abundance of non-lenses. To address this, we have developed a novel approach by ensembling cutting-edge convolutional networks (CNNs) -- for instance, ResNet, Inception, NASNet, MobileNet, EfficientNet, and RegNet -- along with vision transformers (ViTs) trained on realistic galaxy-quasar lens simulations based on the Hyper Suprime-Cam (HSC) multiband images. While the individual model exhibits remarkable performance when evaluated against the test dataset, achieving an area under the receiver operating characteristic curve of $>$97.3% and a median false positive rate of 3.6%, it struggles to generalize in real data, indicated by numerous spurious sources picked by each classifier. A significant improvement is achieved by averaging these CNNs and ViTs, resulting in the impurities being downsized by factors up to 50. Subsequently, combining the HSC images with the UKIRT, VISTA, and unWISE data, we retrieve approximately 60 million sources as parent samples and reduce this to 892,609 after employing a photometry preselection to discover $z>1.5$ lensed quasars with Einstein radii of $θ_\mathrm{E}<5$ arcsec. Afterward, the ensemble classifier indicates 3080 sources with a high probability of being lenses, for which we visually inspect, yielding 210 prevailing candidates awaiting spectroscopic confirmation. These outcomes suggest that automated deep learning pipelines hold great potential in effectively detecting strong lenses in vast datasets with minimal manual visual inspection involved.
△ Less
Submitted 18 August, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Little Red Dots: an abundant population of faint AGN at z~5 revealed by the EIGER and FRESCO JWST surveys
Authors:
Jorryt Matthee,
Rohan P. Naidu,
Gabriel Brammer,
John Chisholm,
Anna-Christina Eilers,
Andy Goulding,
Jenny Greene,
Daichi Kashino,
Ivo Labbe,
Simon J. Lilly,
Ruari Mackenzie,
Pascal A. Oesch,
Andrea Weibel,
Stijn Wuyts,
Mengyuan Xiao,
Rongmon Bordoloi,
Rychard Bouwens,
Pieter van Dokkum,
Garth Illingworth,
Ivan Kramarenko,
Michael V. Maseda,
Charlotte Mason,
Romain A. Meyer,
Erica J. Nelson,
Naveen A. Reddy
, et al. (3 additional authors not shown)
Abstract:
Characterising the prevalence and properties of faint active galactic nuclei (AGN) in the early Universe is key for understanding the formation of supermassive black holes (SMBHs) and determining their role in cosmic reionization. We perform a spectroscopic search for broad H$α$ emitters at $z\approx4-6$ using deep JWST/NIRCam imaging and wide field slitless spectroscopy from the EIGER and FRESCO…
▽ More
Characterising the prevalence and properties of faint active galactic nuclei (AGN) in the early Universe is key for understanding the formation of supermassive black holes (SMBHs) and determining their role in cosmic reionization. We perform a spectroscopic search for broad H$α$ emitters at $z\approx4-6$ using deep JWST/NIRCam imaging and wide field slitless spectroscopy from the EIGER and FRESCO surveys. We identify 20 H$α$ lines at $z=4.2-5.5$ that have broad components with line widths from $\sim1200-3700$ km s$^{-1}$, contributing $\sim30-90$ % of the total line flux. We interpret these broad components as being powered by accretion onto SMBHs with implied masses $\sim10^{7-8}$ M$_{\odot}$. In the UV luminosity range M$_{\rm UV}=-21$ to $-18$, we measure number densities of $\approx10^{-5}$ cMpc$^{-3}$. This is an order of magnitude higher than expected from extrapolating quasar UV luminosity functions. Yet, such AGN are found in only $<1$ % of star-forming galaxies at $z\sim5$. The SMBH mass function agrees with large cosmological simulations. In two objects we detect narrow red- and blue-shifted H$α$ absorption indicative, respectively, of dense gas fueling SMBH growth and outflows. We may be witnessing early AGN feedback that will clear dust-free pathways through which more massive blue quasars are seen. We uncover a strong correlation between reddening and the fraction of total galaxy luminosity arising from faint AGN. This implies that early SMBH growth is highly obscured and that faint AGN are only minor contributors to cosmic reionization.
△ Less
Submitted 1 February, 2024; v1 submitted 8 June, 2023;
originally announced June 2023.
-
The Sloan Digital Sky Survey Reverberation Mapping Project: Key Results
Authors:
Yue Shen,
Catherine J. Grier,
Keith Horne,
Zachary Stone,
Jennifer I. Li,
Qian Yang,
Yasaman Homayouni,
Jonathan R. Trump,
Scott F. Anderson,
W. N. Brandt,
Patrick B. Hall,
Luis C. Ho,
Linhua Jiang,
Patrick Petitjean,
Donald P. Schneider,
Charling Tao,
Fergus. R. Donnan,
Yusra AlSayyad,
Matthew A. Bershady,
Michael R. Blanton,
Dmitry Bizyaev,
Kevin Bundy,
Yuguang Chen,
Megan C. Davis,
Kyle Dawson
, et al. (22 additional authors not shown)
Abstract:
We present the final data from the Sloan Digital Sky Survey Reverberation Mapping (SDSS-RM) project, a precursor to the SDSS-V Black Hole Mapper Reverberation Mapping program. This data set includes 11-year photometric and 7-year spectroscopic light curves for 849 broad-line quasars over a redshift range of 0.1<z<4.5 and a luminosity range of Lbol=1E44-47.5 erg/s, along with spectral and variabili…
▽ More
We present the final data from the Sloan Digital Sky Survey Reverberation Mapping (SDSS-RM) project, a precursor to the SDSS-V Black Hole Mapper Reverberation Mapping program. This data set includes 11-year photometric and 7-year spectroscopic light curves for 849 broad-line quasars over a redshift range of 0.1<z<4.5 and a luminosity range of Lbol=1E44-47.5 erg/s, along with spectral and variability measurements. We report 23, 81, 125, and 110 reverberation mapping lags (relative to optical continuum variability) for broad Halpha, Hbeta, MgII and CIV using the SDSS-RM sample, spanning much of the luminosity and redshift ranges of the sample. Using 30 low-redshift RM AGNs with dynamical-modeling black hole masses, we derive a new estimate of the average virial factor of <log f>=0.62+-0.07 for the line dispersion measured from the RMS spectrum. The intrinsic scatter of individual virial factors is 0.31+-0.07 dex, indicating a factor of two systematic uncertainty in RM black hole masses. Our lag measurements reveal significant R-L relations for Hbeta and MgII at high redshift, consistent with the latest measurements based on heterogeneous samples. While we are unable to robustly constrain the slope of the R-L relation for CIV given the limited dynamical range in luminosity, we found substantially larger scatter in CIV lags at fixed L1350. Using the SDSS-RM lag sample, we derive improved single-epoch (SE) mass recipes for Hbeta, MgII and CIV, which are consistent with their respective RM masses as well as between the SE recipes from two different lines, over the luminosity range probed by our sample. The new Hbeta and MgII recipes are approximately unbiased estimators at given RM masses, but there are systematic biases in the CIV recipe. The intrinsic scatter of SE masses around RM masses is ~0.45 dex for Hbeta and MgII, increasing to ~0.58 dex for CIV.
△ Less
Submitted 1 April, 2024; v1 submitted 1 May, 2023;
originally announced May 2023.
-
A SPectroscopic survey of biased halos In the Reionization Era (ASPIRE): JWST Reveals a Filamentary Structure around a z=6.61 Quasar
Authors:
Feige Wang,
Jinyi Yang,
Joseph F. Hennawi,
Xiaohui Fan,
Fengwu Sun,
Jaclyn B. Champagne,
Tiago Costa,
Melanie Habouzit,
Ryan Endsley,
Zihao Li,
Xiaojing Lin,
Romain A. Meyer,
Jan-Torge Schindler,
Yunjing Wu,
Eduardo Bañados,
Aaron J. Barth,
Aklant K. Bhowmick,
Rebekka Bieri,
Laura Blecha,
Sarah Bosman,
Zheng Cai,
Luis Colina,
Thomas Connor,
Frederick B. Davies,
Roberto Decarli
, et al. (34 additional authors not shown)
Abstract:
We present the first results from the JWST ASPIRE program (A SPectroscopic survey of biased halos In the Reionization Era). This program represents an imaging and spectroscopic survey of 25 reionization-era quasars and their environments by utilizing the unprecedented capabilities of NIRCam Wide Field Slitless Spectroscopy (WFSS) mode. ASPIRE will deliver the largest ($\sim280~{\rm arcmin}^2$) gal…
▽ More
We present the first results from the JWST ASPIRE program (A SPectroscopic survey of biased halos In the Reionization Era). This program represents an imaging and spectroscopic survey of 25 reionization-era quasars and their environments by utilizing the unprecedented capabilities of NIRCam Wide Field Slitless Spectroscopy (WFSS) mode. ASPIRE will deliver the largest ($\sim280~{\rm arcmin}^2$) galaxy redshift survey at 3-4 $μ$m among JWST Cycle-1 programs and provide extensive legacy values for studying the formation of the earliest supermassive black holes (SMBHs), the assembly of galaxies, early metal enrichment, and cosmic reionization. In this first ASPIRE paper, we report the discovery of a filamentary structure traced by the luminous quasar J0305-3150 and ten [OIII] emitters at $z=6.6$. This structure has a 3D galaxy overdensity of $δ_{\rm gal}=12.6$ over 637 cMpc$^3$, one of the most overdense structures known in the early universe, and could eventually evolve into a massive galaxy cluster. Together with existing VLT/MUSE and ALMA observations of this field, our JWST observations reveal that J0305-3150 traces a complex environment where both UV-bright and dusty galaxies are present, and indicate that the early evolution of galaxies around the quasar is not simultaneous. In addition, we discovered 31 [OIII] emitters in this field at other redshifts, $5.3<z<6.7$, with half of them situated at $z\sim5.4$ and $z\sim6.2$. This indicates that star-forming galaxies, such as [OIII] emitters, are generally clustered at high redshifts. These discoveries demonstrate the unparalleled redshift survey capabilities of NIRCam WFSS and the potential of the full ASPIRE survey dataset.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
A SPectroscopic survey of biased halos In the Reionization Era (ASPIRE): A First Look at the Rest-frame Optical Spectra of $z > 6.5$ Quasars Using JWST
Authors:
Jinyi Yang,
Feige Wang,
Xiaohui Fan,
Joseph F. Hennawi,
Aaron J. Barth,
Eduardo Bañados,
Fengwu Sun,
Weizhe Liu,
Zheng Cai,
Linhua Jiang,
Zihao Li,
Masafusa Onoue,
Jan-Torge Schindler,
Yue Shen,
Yunjing Wu,
Aklant K. Bhowmick,
Rebekka Bieri,
Laura Blecha,
Sarah Bosman,
Jaclyn B. Champagne,
Luis Colina,
Thomas Connor,
Tiago Costa,
Frederick B. Davies,
Roberto Decarli
, et al. (31 additional authors not shown)
Abstract:
Studies of rest-frame optical emission in quasars at $z>6$ have historically been limited by the wavelengths accessible by ground-based telescopes. The James Webb Space Telescope (JWST) now offers the opportunity to probe this emission deep into the reionization epoch. We report the observations of eight quasars at $z>6.5$ using the JWST/NIRCam Wide Field Slitless Spectroscopy, as a part of the ''…
▽ More
Studies of rest-frame optical emission in quasars at $z>6$ have historically been limited by the wavelengths accessible by ground-based telescopes. The James Webb Space Telescope (JWST) now offers the opportunity to probe this emission deep into the reionization epoch. We report the observations of eight quasars at $z>6.5$ using the JWST/NIRCam Wide Field Slitless Spectroscopy, as a part of the ''A SPectroscopic survey of biased halos In the Reionization Era (ASPIRE)" program. Our JWST spectra cover the quasars' emission between rest frame $\sim$ 4100 and 5100 Å. The profiles of these quasars' broad H$β$ emission lines span a FWHM from 3000 to 6000 $\rm{km~s^{-1}}$. The H$β$-based virial black hole (BH) masses, ranging from 0.6 to 2.1 billion solar masses, are generally consistent with their MgII-based BH masses. The new measurements based on the more reliable H$β$ tracer thus confirm the existence of billion solar-mass BHs in the reionization epoch. In the observed [OIII] $λλ$4960,5008 doublets of these luminous quasars, broad components are more common than narrow core components ($\le~1200~\rm{km~s^{-1}}$), and only one quasar shows stronger narrow components than broad. Two quasars exhibit significantly broad and blueshifted [OIII] emission, thought to trace galactic-scale outflows, with median velocities of $-610~\rm{km~s^{-1}}$ and $-1430~\rm{km~s^{-1}}$ relative to the [CII] $158\,μ$m line. All eight quasars show strong optical FeII emission, and follow the Eigenvector 1 relations defined by low-redshift quasars. The entire ASPIRE program will eventually cover 25 quasars and provide a statistical sample for the studies of the BHs and quasar spectral properties.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Detecting and Characterizing Young Quasars. III. The Impact of Gravitational Lensing Magnification
Authors:
Minghao Yue,
Anna-Christina Eilers,
Robert A. Simcoe,
Sirio Belli,
Frederick B. Davies,
David DePalma,
Joseph F. Hennawi,
Charlotte A. Mason,
Julian B. Mu�oz,
Erica J. Nelson,
Sandro Tacchella
Abstract:
We test the impact of gravitational lensing on the lifetime estimates of seven high-redshift quasars at redshift $z\gtrsim6$. The targeted quasars are identified by their small observed proximity zone sizes, which indicate extremely short quasar lifetimes $(t_Q\lesssim10^5 \text{ yrs})$. However, these estimates of quasar lifetimes rely on the assumption that the observed luminosities of the quasa…
▽ More
We test the impact of gravitational lensing on the lifetime estimates of seven high-redshift quasars at redshift $z\gtrsim6$. The targeted quasars are identified by their small observed proximity zone sizes, which indicate extremely short quasar lifetimes $(t_Q\lesssim10^5 \text{ yrs})$. However, these estimates of quasar lifetimes rely on the assumption that the observed luminosities of the quasars are intrinsic and not magnified by gravitational lensing, which would bias the lifetime estimates towards younger ages. In order to test possible effects of gravitational lensing, we obtain high-resolution images of the seven quasars with the {\em Hubble Space Telescope (HST)} and look for signs of strong lensing. We do not find any evidence of strong lensing, i.e., all quasars are well-described by point sources, and no foreground lensing galaxy is detected. We estimate that the strong lensing probabilities for these quasars are extremely small $(\sim1.4\times10^{-5})$, and show that weak lensing changes the estimated quasar lifetimes by only $\lesssim0.2$ dex. We thus confirm that the short lifetimes of these quasars are intrinsic. The existence of young quasars indicates a high obscured fraction, radiatively inefficient accretion, and/or flickering light curves for high-redshift quasars. We further discuss the impact of lensing magnification on measurements of black hole masses and Eddington ratios of quasars.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
A Survey for High-redshift Gravitationally Lensed Quasars and Close Quasars Pairs. I. the Discoveries of an Intermediately-lensed Quasar and a Kpc-scale Quasar Pair at $z\sim5$
Authors:
Minghao Yue,
Xiaohui Fan,
Jinyi Yang,
Feige Wang
Abstract:
We present the first results from a new survey for high-redshift $(z\gtrsim5)$ gravitationally lensed quasars and close quasar pairs. We carry out candidate selection based on the colors and shapes of objects in public imaging surveys, then conduct follow-up observations to confirm the nature of high-priority candidates. In this paper, we report the discoveries of J0025--0145 ($z=5.07$) which we i…
▽ More
We present the first results from a new survey for high-redshift $(z\gtrsim5)$ gravitationally lensed quasars and close quasar pairs. We carry out candidate selection based on the colors and shapes of objects in public imaging surveys, then conduct follow-up observations to confirm the nature of high-priority candidates. In this paper, we report the discoveries of J0025--0145 ($z=5.07$) which we identify as an {intermediately-lensed quasar, and J2329--0522 ($z=4.85$) which is a kpc-scale close quasar pair. The {\em Hubble Space Telescope (HST)} image of J0025--0145 shows a foreground lensing galaxy located $0\farcs6$ away from the quasar. However, J0025--0145 does not exhibit multiple lensed images of the quasar, and we identify J0025--0145 as an intermediate lensing system (a lensing system that is not multiply imaged but has a significant magnification). The spectrum of J0025--0145 implies an extreme Eddington ratio if the quasar luminosity is intrinsic, which could be explained by a large lensing magnification. The {\em HST} image of J0025--0145 also indicates a tentative detection of the quasar host galaxy in rest-frame UV, illustrating the power of lensing magnification and distortion in studies of high-redshift quasar host galaxies. J2329--0522 consists of two resolved components with significantly different spectral properties, and a lack of lensing galaxy detection under sub-arcsecond seeing. We identify it as a close quasar pair, which is the highest confirmed kpc-scale quasar pair to date. We also report four lensed quasars and quasar pairs at $2<z<4$, and discuss possible improvements to our survey strategy.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Federated attention consistent learning models for prostate cancer diagnosis and Gleason grading
Authors:
Fei Kong,
Xiyue Wang,
Jinxi Xiang,
Sen Yang,
Xinran Wang,
Meng Yue,
Jun Zhang,
Junhan Zhao,
Xiao Han,
Yuhan Dong,
Biyue Zhu,
Fang Wang,
Yueping Liu
Abstract:
Artificial intelligence (AI) holds significant promise in transforming medical imaging, enhancing diagnostics, and refining treatment strategies. However, the reliance on extensive multicenter datasets for training AI models poses challenges due to privacy concerns. Federated learning provides a solution by facilitating collaborative model training across multiple centers without sharing raw data.…
▽ More
Artificial intelligence (AI) holds significant promise in transforming medical imaging, enhancing diagnostics, and refining treatment strategies. However, the reliance on extensive multicenter datasets for training AI models poses challenges due to privacy concerns. Federated learning provides a solution by facilitating collaborative model training across multiple centers without sharing raw data. This study introduces a federated attention-consistent learning (FACL) framework to address challenges associated with large-scale pathological images and data heterogeneity. FACL enhances model generalization by maximizing attention consistency between local clients and the server model. To ensure privacy and validate robustness, we incorporated differential privacy by introducing noise during parameter transfer. We assessed the effectiveness of FACL in cancer diagnosis and Gleason grading tasks using 19,461 whole-slide images of prostate cancer from multiple centers. In the diagnosis task, FACL achieved an area under the curve (AUC) of 0.9718, outperforming seven centers with an average AUC of 0.9499 when categories are relatively balanced. For the Gleason grading task, FACL attained a Kappa score of 0.8463, surpassing the average Kappa score of 0.7379 from six centers. In conclusion, FACL offers a robust, accurate, and cost-effective AI training model for prostate cancer pathology while maintaining effective data safeguards.
△ Less
Submitted 28 March, 2024; v1 submitted 12 February, 2023;
originally announced February 2023.
-
On Approximating the Dynamic Response of Synchronous Generators via Operator Learning: A Step Towards Building Deep Operator-based Power Grid Simulators
Authors:
Christian Moya,
Guang Lin,
Tianqiao Zhao,
Meng Yue
Abstract:
This paper designs an Operator Learning framework to approximate the dynamic response of synchronous generators. One can use such a framework to (i) design a neural-based generator model that can interact with a numerical simulator of the rest of the power grid or (ii) shadow the generator's transient response. To this end, we design a data-driven Deep Operator Network~(DeepONet) that approximates…
▽ More
This paper designs an Operator Learning framework to approximate the dynamic response of synchronous generators. One can use such a framework to (i) design a neural-based generator model that can interact with a numerical simulator of the rest of the power grid or (ii) shadow the generator's transient response. To this end, we design a data-driven Deep Operator Network~(DeepONet) that approximates the generators' infinite-dimensional solution operator. Then, we develop a DeepONet-based numerical scheme to simulate a given generator's dynamic response over a short/medium-term horizon. The proposed numerical scheme recursively employs the trained DeepONet to simulate the response for a given multi-dimensional input, which describes the interaction between the generator and the rest of the system. Furthermore, we develop a residual DeepONet numerical scheme that incorporates information from mathematical models of synchronous generators. We accompany this residual DeepONet scheme with an estimate for the prediction's cumulative error. We also design a data aggregation (DAgger) strategy that allows (i) employing supervised learning to train the proposed DeepONets and (ii) fine-tuning the DeepONet using aggregated training data that the DeepONet is likely to encounter during interactive simulations with other grid components. Finally, as a proof of concept, we demonstrate that the proposed DeepONet frameworks can effectively approximate the transient model of a synchronous generator.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
Prethermal time-crystalline spin ice and monopole confinement in a driven magnet
Authors:
Mingxi Yue,
Zi Cai
Abstract:
Studies on systems far from equilibrium open up new avenues for investigating exotic phases of matter. A driven-dissipative frustrated spin system is examined in this study, and we suggest an out-of-equilibrium non-magnetic phase where the spins do not order but adhere to the ice rule in space and establish a long-range crystalline order in time. In contrast to the conventional spin ice, the dynam…
▽ More
Studies on systems far from equilibrium open up new avenues for investigating exotic phases of matter. A driven-dissipative frustrated spin system is examined in this study, and we suggest an out-of-equilibrium non-magnetic phase where the spins do not order but adhere to the ice rule in space and establish a long-range crystalline order in time. In contrast to the conventional spin ice, the dynamics of monopoles is confined due to the nonequilibrium feature of our model. Possible experimental realizations of our model has been discussed.
△ Less
Submitted 18 July, 2023; v1 submitted 8 January, 2023;
originally announced January 2023.
-
EIGER III. JWST/NIRCam observations of the ultra-luminous high-redshift quasar J0100+2802
Authors:
Anna-Christina Eilers,
Robert A. Simcoe,
Minghao Yue,
Ruari Mackenzie,
Jorryt Matthee,
Dominika Durovcikova,
Daichi Kashino,
Rongmon Bordoloi,
Simon J. Lilly
Abstract:
We present the first rest-frame optical spectrum of a high-redshift quasar observed with JWST/NIRCam in Wide Field Slitless (WFSS) mode. The observed quasar, J0100+2802, is the most luminous quasar known at $z>6$. We measure the mass of the central supermassive black hole (SMBH) by means of the rest-frame optical H$β$ emission line, and find consistent mass measurements of the quasar's SMBH of…
▽ More
We present the first rest-frame optical spectrum of a high-redshift quasar observed with JWST/NIRCam in Wide Field Slitless (WFSS) mode. The observed quasar, J0100+2802, is the most luminous quasar known at $z>6$. We measure the mass of the central supermassive black hole (SMBH) by means of the rest-frame optical H$β$ emission line, and find consistent mass measurements of the quasar's SMBH of $M_\bullet\approx10^{10}\,M_\odot$ when compared to the estimates based on the properties of rest-frame UV emission lines CIV and MgII, which are accessible from ground-based observatories. To this end, we also present a newly reduced rest-frame UV spectrum of the quasar observed with X-Shooter/VLT and FIRE/Magellan for a total of 16.8 hours. We readdress the question whether this ultra-luminous quasar could be effected by strong gravitational lensing making use of the diffraction limited NIRCam images in three different wide band filters (F115W, F200W, F356W), which improves the achieved spatial resolution compared to previous images taken with the Hubble Space Telescope by a factor of two. We do not find any evidence for a foreground deflecting galaxy, nor for multiple images of the quasar, and determine the probability for magnification due to strong gravitational lensing with image separations below the diffraction limit of $Δθ\lesssim 0.05''$ to be $\lesssim 2.2\times 10^{-3}$. Our observations therefore confirm that this quasar hosts a ten billion solar mass black hole less than $1$ Gyr after the Big Bang, which is challenging to explain with current black hole formation models.
△ Less
Submitted 18 May, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
OFDM-Based Massive Connectivity for LEO Satellite Internet of Things
Authors:
Yong Zuo,
Mingyang Yue,
Mingchen Zhang,
Sixian Li,
Shaojie Ni,
Xiaojun Yuan
Abstract:
Low earth orbit (LEO) satellite has been considered as a potential supplement for the terrestrial Internet of Things (IoT). In this paper, we consider grant-free non-orthogonal random access (GF-NORA) in orthogonal frequency division multiplexing (OFDM) system to increase access capacity and reduce access latency for LEO satellite-IoT. We focus on the joint device activity detection (DAD) and chan…
▽ More
Low earth orbit (LEO) satellite has been considered as a potential supplement for the terrestrial Internet of Things (IoT). In this paper, we consider grant-free non-orthogonal random access (GF-NORA) in orthogonal frequency division multiplexing (OFDM) system to increase access capacity and reduce access latency for LEO satellite-IoT. We focus on the joint device activity detection (DAD) and channel estimation (CE) problem at the satellite access point. The delay and the Doppler effect of the LEO satellite channel are assumed to be partially compensated. We propose an OFDM-symbol repetition technique to better distinguish the residual Doppler frequency shifts, and present a grid-based parametric probability model to characterize channel sparsity in the delay-Doppler-user domain, as well as to characterize the relationship between the channel states and the device activity. Based on that, we develop a robust Bayesian message passing algorithm named modified variance state propagation (MVSP) for joint DAD and CE. Moreover, to tackle the mismatch between the real channel and its on-grid representation, an expectation-maximization (EM) framework is proposed to learn the grid parameters. Simulation results demonstrate that our proposed algorithms significantly outperform the existing approaches in both activity detection probability and channel estimation accuracy.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
Space-time symmetry breaking in nonequilibrium frustrated magnetism
Authors:
Mingxi Yue,
Zi Cai
Abstract:
Spontaneous symmetry breaking is responsible for the rich phenomena in equilibrium physics. Driving a system out-of-equilibrium can significantly enrich the possibility of spontaneous symmetry breaking, which occurs not only in space, but also in time domain. This study investigates a driven-dissipative frustrated magnetic system. Results show that frustration in such a far-from-equilibrium system…
▽ More
Spontaneous symmetry breaking is responsible for the rich phenomena in equilibrium physics. Driving a system out-of-equilibrium can significantly enrich the possibility of spontaneous symmetry breaking, which occurs not only in space, but also in time domain. This study investigates a driven-dissipative frustrated magnetic system. Results show that frustration in such a far-from-equilibrium system could lead to a wealth of intriguing non-equilibrium phases with intertwined space-time symmetry breaking, (e.g.) a discrete time crystal phase accompanied by a time-dependent spatial order oscillating between a long-range tripartite stripe and a short-range ferromagnetic order.
△ Less
Submitted 8 January, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Approximate Secular Equations for the Cubic Regularization Subproblem
Authors:
Yihang Gao,
Man-Chung Yue,
Michael K. Ng
Abstract:
The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper…
▽ More
The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper, we propose and analyze a novel CRS solver based on an approximate secular equation, which requires only some of the Hessian eigenvalues and is therefore much more efficient. Two approximate secular equations (ASEs) are developed. For both ASEs, we first study the existence and uniqueness of their roots and then establish an upper bound on the gap between the root and that of the standard secular equation. Such an upper bound can in turn be used to bound the distance from the approximate CRS solution based ASEs to the true CRS solution, thus offering a theoretical guarantee for our CRS solver. A desirable feature of our CRS solver is that it requires only matrix-vector multiplication but not matrix inversion, which makes it particularly suitable for high-dimensional applications of unconstrained non-convex optimization, such as low-rank recovery and deep learning. Numerical experiments with synthetic and real data-sets are conducted to investigate the practical performance of the proposed CRS solver. Experimental results show that the proposed solver outperforms two state-of-the-art methods.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
DeepGraphONet: A Deep Graph Operator Network to Learn and Zero-shot Transfer the Dynamic Response of Networked Systems
Authors:
Yixuan Sun,
Christian Moya,
Guang Lin,
Meng Yue
Abstract:
This paper develops a Deep Graph Operator Network (DeepGraphONet) framework that learns to approximate the dynamics of a complex system (e.g. the power grid or traffic) with an underlying sub-graph structure. We build our DeepGraphONet by fusing the ability of (i) Graph Neural Networks (GNN) to exploit spatially correlated graph information and (ii) Deep Operator Networks~(DeepONet) to approximate…
▽ More
This paper develops a Deep Graph Operator Network (DeepGraphONet) framework that learns to approximate the dynamics of a complex system (e.g. the power grid or traffic) with an underlying sub-graph structure. We build our DeepGraphONet by fusing the ability of (i) Graph Neural Networks (GNN) to exploit spatially correlated graph information and (ii) Deep Operator Networks~(DeepONet) to approximate the solution operator of dynamical systems. The resulting DeepGraphONet can then predict the dynamics within a given short/medium-term time horizon by observing a finite history of the graph state information. Furthermore, we design our DeepGraphONet to be resolution-independent. That is, we do not require the finite history to be collected at the exact/same resolution. In addition, to disseminate the results from a trained DeepGraphONet, we design a zero-shot learning strategy that enables using it on a different sub-graph. Finally, empirical results on the (i) transient stability prediction problem of power grids and (ii) traffic flow forecasting problem of a vehicular system illustrate the effectiveness of the proposed DeepGraphONet.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
RIS-Aided Multiuser MIMO-OFDM with Linear Precoding and Iterative Detection: Analysis and Optimization
Authors:
Mingyang Yue,
Lei Liu,
Xiaojun Yuan
Abstract:
In this paper, we consider a reconfigurable intelligence surface (RIS) aided uplink multiuser multi-input multi-output (MIMO) orthogonal frequency division multiplexing (OFDM) system, where the receiver is assumed to conduct low-complexity iterative detection. We aim to minimize the total transmit power by jointly designing the precoder of the transmitter and the passive beamforming of the RIS. Th…
▽ More
In this paper, we consider a reconfigurable intelligence surface (RIS) aided uplink multiuser multi-input multi-output (MIMO) orthogonal frequency division multiplexing (OFDM) system, where the receiver is assumed to conduct low-complexity iterative detection. We aim to minimize the total transmit power by jointly designing the precoder of the transmitter and the passive beamforming of the RIS. This problem can be tackled from the perspective of information theory. But this information-theoretic approach may involve prohibitively high complexity since the number of rate constraints that specify the capacity region of the uplink multiuser channel is exponential in the number of users. To avoid this difficulty, we formulate the design problem of the iterative receiver under the constraints of a maximal iteration number and target bit error rates of users. To tackle this challenging problem, we propose a groupwise successive interference cancellation (SIC) optimization approach, where the signals of users are decoded and cancelled in a group-by-group manner. We present a heuristic user grouping strategy, and resort to the alternating optimization technique to iteratively solve the precoding and passive beamforming sub-problems. Specifically, for the precoding sub-problem, we employ fractional programming to convert it to a convex problem; for the passive beamforming sub-problem, we adopt successive convex approximation to deal with the unit-modulus constraints of the RIS. We show that the proposed groupwise SIC approach has significant advantages in both performance and computational complexity, as compared with the counterpart approaches.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.