Skip to main content

Showing 1–50 of 146 results for author: Hassan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.13641  [pdf, other

    cs.CL

    An Active Learning Framework for Inclusive Generation by Large Language Models

    Authors: Sabit Hassan, Anthony Sicilia, Malihe Alikhani

    Abstract: Ensuring that Large Language Models (LLMs) generate text representative of diverse sub-populations is essential, particularly when key concepts related to under-represented groups are scarce in the training data. We address this challenge with a novel clustering-based active learning framework, enhanced with knowledge distillation. The proposed framework transforms the intermediate outputs of the… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  2. arXiv:2410.11114  [pdf, other

    cs.CL

    Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

    Authors: Sabit Hassan, Anthony Sicilia, Malihe Alikhani

    Abstract: Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing systems. While Large Language Models (LLMs) can generate valuable data for safety measures, they often exhibit distributional biases, focusing on common scenarios and neglecting rare but critical cases. This can undermine the effectiveness of safety protocols developed using such data. To address this, we p… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  3. arXiv:2409.18718  [pdf, other

    cs.NI cs.LG

    Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement Learning

    Authors: Sheikh Salman Hassan, Yu Min Park, Yan Kyaw Tun, Walid Saad, Zhu Han, Choong Seon Hong

    Abstract: In this paper, a novel generative adversarial imitation learning (GAIL)-powered policy learning approach is proposed for optimizing beamforming, spectrum allocation, and remote user equipment (RUE) association in NTNs. Traditional reinforcement learning (RL) methods for wireless network optimization often rely on manually designed reward functions, which can require extensive parameter tuning. To… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: Submitted to IEEE Transactions on Mobile Computing (16 pages, 10 figures)

  4. Multi-modal Medical Image Fusion For Non-Small Cell Lung Cancer Classification

    Authors: Salma Hassan, Hamad Al Hammadi, Ibrahim Mohammed, Muhammad Haris Khan

    Abstract: The early detection and nuanced subtype classification of non-small cell lung cancer (NSCLC), a predominant cause of cancer mortality worldwide, is a critical and complex issue. In this paper, we introduce an innovative integration of multi-modal data, synthesizing fused medical imaging (CT and PET scans) with clinical health records and genomic data. This unique fusion methodology leverages advan… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  5. arXiv:2409.15724  [pdf, other

    cs.SE cs.AI cs.IR

    LLM-Cure: LLM-based Competitor User Review Analysis for Feature Enhancement

    Authors: Maram Assi, Safwat Hassan, Ying Zou

    Abstract: The exponential growth of the mobile app market underscores the importance of constant innovation and rapid response to user demands. As user satisfaction is paramount to the success of a mobile application (app), developers typically rely on user reviews, which represent user feedback that includes ratings and comments to identify areas for improvement. However, the sheer volume of user reviews p… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 25 pages

  6. arXiv:2409.14726  [pdf, other

    cs.ET eess.SP

    Semantic Communication Enabled 6G-NTN Framework: A Novel Denoising and Gateway Hop Integration Mechanism

    Authors: Loc X. Nguyen, Sheikh Salman Hassan, Yan Kyaw Tun, Kitae Kim, Zhu Han, Choong Seon Hong

    Abstract: The sixth-generation (6G) non-terrestrial networks (NTNs) are crucial for real-time monitoring in critical applications like disaster relief. However, limited bandwidth, latency, rain attenuation, long propagation delays, and co-channel interference pose challenges to efficient satellite communication. Therefore, semantic communication (SC) has emerged as a promising solution to improve transmissi… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 13 pages, 8 figures, 2 tables

  7. arXiv:2408.07997  [pdf, other

    quant-ph cs.ET

    Enhanced Quantum Energy Teleportation using a 3-Qubit System

    Authors: Md Shoyib Hassan, Syed Emad Uddin Shubha, M. R. C Mahdy

    Abstract: Quantum Energy Teleportation (QET) is a novel method that leverages quantum entanglement to transfer energy between two distant locations without any physical movement of the energy. The first realization of QET on superconducting hardware, utilizing a 2-qubit system, demonstrated an average energy retrieval efficiency of 35.4% (observing only V ) by the receiver, Bob. In this paper, we present a… ▽ More

    Submitted 14 October, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: 13 pages, 13 figures, 2 table, 50+ equations

  8. arXiv:2408.03959  [pdf, other

    cs.NI eess.SP

    Semantic Enabled 6G LEO Satellite Communication for Earth Observation: A Resource-Constrained Network Optimization

    Authors: Sheikh Salman Hassan, Loc X. Nguyen, Yan Kyaw Tun, Zhu Han, Choong Seon Hong

    Abstract: Earth observation satellites generate large amounts of real-time data for monitoring and managing time-critical events such as disaster relief missions. This presents a major challenge for satellite-to-ground communications operating under limited bandwidth capacities. This paper explores semantic communication (SC) as a potential alternative to traditional communication methods. The rationality f… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: Accepted in GLOBECOM 2024

  9. arXiv:2407.15806  [pdf, other

    cs.CV cs.CL

    FSboard: Over 3 million characters of ASL fingerspelling collected via smartphones

    Authors: Manfred Georg, Garrett Tanzer, Saad Hassan, Maximus Shengelia, Esha Uboweja, Sam Sepah, Sean Forbes, Thad Starner

    Abstract: Progress in machine understanding of sign languages has been slow and hampered by limited data. In this paper, we present FSboard, an American Sign Language fingerspelling dataset situated in a mobile text entry use case, collected from 147 paid and consenting Deaf signers using Pixel 4A selfie cameras in a variety of environments. Fingerspelling recognition is an incomplete solution that is only… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Access FSboard at https://www.kaggle.com/datasets/googleai/fsboard

  10. arXiv:2407.12941  [pdf, other

    cs.RO

    Robotic Arm Manipulation with Inverse Reinforcement Learning & TD-MPC

    Authors: Md Shoyib Hassan, Sabir Md Sanaullah

    Abstract: One unresolved issue is how to scale model-based inverse reinforcement learning (IRL) to actual robotic manipulation tasks with unpredictable dynamics. The ability to learn from both visual and proprioceptive examples, creating algorithms that scale to high-dimensional state-spaces, and mastering strong dynamics models are the main obstacles. In this work, we provide a gradient-based inverse reinf… ▽ More

    Submitted 7 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: 10 pages, 13 figures

    ACM Class: I.2.9

  11. arXiv:2407.04542  [pdf, other

    cs.NI cs.CV cs.LG eess.IV

    Rethinking Image Compression on the Web with Generative AI

    Authors: Shayan Ali Hassan, Danish Humair, Ihsan Ayyub Qazi, Zafar Ayyub Qazi

    Abstract: The rapid growth of the Internet, driven by social media, web browsing, and video streaming, has made images central to the Web experience, resulting in significant data transfer and increased webpage sizes. Traditional image compression methods, while reducing bandwidth, often degrade image quality. This paper explores a novel approach using generative AI to reconstruct images at the edge or clie… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  12. arXiv:2406.13280  [pdf, other

    cs.NI cs.AI

    Design Optimization of NOMA Aided Multi-STAR-RIS for Indoor Environments: A Convex Approximation Imitated Reinforcement Learning Approach

    Authors: Yu Min Park, Sheikh Salman Hassan, Yan Kyaw Tun, Eui-Nam Huh, Walid Saad, Choong Seon Hong

    Abstract: Non-orthogonal multiple access (NOMA) enables multiple users to share the same frequency band, and simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) provides 360-degree full-space coverage, optimizing both transmission and reflection for improved network performance and dynamic control of the indoor environment. However, deploying STAR-RIS indoors presents ch… ▽ More

    Submitted 17 September, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: 37 pages, 11 figures. arXiv admin note: text overlap with arXiv:2311.08708

  13. arXiv:2406.03773  [pdf, other

    cs.IT

    Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation

    Authors: Loc X. Nguyen, Kitae Kim, Ye Lin Tun, Sheikh Salman Hassan, Yan Kyaw Tun, Zhu Han, Choong Seon Hong

    Abstract: Semantic communication, notable for ensuring quality of service by jointly optimizing source and channel coding, effectively extracts data semantics, reduces transmission length, and mitigates channel noise. However, most studies overlook multi-user scenarios and resource availability, limiting real-world application. This paper addresses this gap by focusing on downlink communication from a base… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 5 pages, 5 figures

  14. arXiv:2405.07189  [pdf

    cs.NI

    A hybrid meta-heuristic approach for channel estimation in OFDM MIMO

    Authors: Shahriar Hassan, Umme Farhana, Md Karam Newaz

    Abstract: In wireless communication Multiple Input Multiple Output (MIMO) technology has brought significant improvement in service by adopting Orthogonal Frequency Division Multiplexing (OFDM), a digital modulation technique. To achieve great performance with MIMO efficiently gathering channel state information (CSI) plays a vital role. Among different approach of channel estimation techniques data-aided c… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Journal ref: Journal of Gono Bishwabidyalay, Vol. 4, Issue. 1, PP. 224-236, 2023

  15. arXiv:2404.17046  [pdf, other

    cs.SE cs.AI

    Unraveling Code Clone Dynamics in Deep Learning Frameworks

    Authors: Maram Assi, Safwat Hassan, Ying Zou

    Abstract: Deep Learning (DL) frameworks play a critical role in advancing artificial intelligence, and their rapid growth underscores the need for a comprehensive understanding of software quality and maintainability. DL frameworks, like other systems, are prone to code clones. Code clones refer to identical or highly similar source code fragments within the same project or even across different projects. C… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 37 pages

  16. arXiv:2404.16208  [pdf, other

    cs.ET cs.AI

    GPU-RANC: A CUDA Accelerated Simulation Framework for Neuromorphic Architectures

    Authors: Sahil Hassan, Michael Inouye, Miguel C. Gonzalez, Ilkin Aliyev, Joshua Mack, Maisha Hafiz, Ali Akoglu

    Abstract: Open-source simulation tools play a crucial role for neuromorphic application engineers and hardware architects to investigate performance bottlenecks and explore design optimizations before committing to silicon. Reconfigurable Architecture for Neuromorphic Computing (RANC) is one such tool that offers ability to execute pre-trained Spiking Neural Network (SNN) models within a unified ecosystem t… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted for publication in Neuro-Inspired Computational Elements (NICE) Workshop 2024

  17. arXiv:2403.16609  [pdf, other

    cs.CL

    Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units

    Authors: Biswesh Mohapatra, Seemab Hassan, Laurent Romary, Justine Cassell

    Abstract: Successful conversations often rest on common understanding, where all parties are on the same page about the information being shared. This process, known as conversational grounding, is crucial for building trustworthy dialog systems that can accurately keep track of and recall the shared information. The proficiencies of an agent in grounding the conveyed information significantly contribute to… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Journal ref: LREC-COLING 2024

  18. arXiv:2403.14120  [pdf, other

    cs.LG cs.AI eess.SP

    Advancing IIoT with Over-the-Air Federated Learning: The Role of Iterative Magnitude Pruning

    Authors: Fazal Muhammad Ali Khan, Hatem Abou-Zeid, Aryan Kaushik, Syed Ali Hassan

    Abstract: The industrial Internet of Things (IIoT) under Industry 4.0 heralds an era of interconnected smart devices where data-driven insights and machine learning (ML) fuse to revolutionize manufacturing. A noteworthy development in IIoT is the integration of federated learning (FL), which addresses data privacy and security among devices. FL enables edge sensors, also known as peripheral intelligence uni… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 6 pages, 6 figures

  19. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  20. arXiv:2402.06803  [pdf, ps, other

    cs.DM cs.CC math.CO

    On graphs with well-distributed edge density

    Authors: Syed Mujtaba Hassan, Shahid Hussain

    Abstract: In this paper, we introduce a class of graphs which we call average hereditary graphs. Most graphs that occur in the usual graph theory applications belong to this class of graphs. Many popular types of graphs fall under this class, such as regular graphs, trees and other popular classes of graphs. We prove a new upper bound for the chromatic number of a graph in terms of its maximum average degre… ▽ More

    Submitted 21 March, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

    MSC Class: 05C15; 05C42; 05C07 (Primary) 05C75; 05C69; 05C25 (Secondary)

  21. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  22. arXiv:2312.10647  [pdf, other

    cs.RO eess.SY

    Single-Stage Optimization of Open-loop Stable Limit Cycles with Smooth, Symbolic Derivatives

    Authors: Muhammad Saud Ul Hassan, Christian Hubicki

    Abstract: Open-loop stable limit cycles are foundational to legged robotics, providing inherent self-stabilization that minimizes the need for computationally intensive feedback-based gait correction. While previous methods have primarily targeted specific robotic models, this paper introduces a general framework for rapidly generating limit cycles across various dynamical systems, with the flexibility to i… ▽ More

    Submitted 17 September, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: 7 pages, 7 figures, submitted to ICRA-2025

  23. arXiv:2311.18147  [pdf, other

    cs.CL

    DisCGen: A Framework for Discourse-Informed Counterspeech Generation

    Authors: Sabit Hassan, Malihe Alikhani

    Abstract: Counterspeech can be an effective method for battling hateful content on social media. Automated counterspeech generation can aid in this process. Generated counterspeech, however, can be viable only when grounded in the context of topic, audience and sensitivity as these factors influence both the efficacy and appropriateness. In this work, we propose a novel framework based on theories of discou… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: IJCNLP-AACL, 2023

  24. arXiv:2311.18007  [pdf, other

    astro-ph.IM astro-ph.GA cs.LG

    Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations

    Authors: Yash Gondhalekar, Sultan Hassan, Naomi Saphra, Sambatra Andrianomena

    Abstract: The generalization of machine learning (ML) models to out-of-distribution (OOD) examples remains a key challenge in extracting information from upcoming astronomical surveys. Interpretability approaches are a natural way to gain insights into the OOD generalization problem. We use Centered Kernel Alignment (CKA), a similarity measure metric of neural network representations, to examine the relatio… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted to Machine Learning and the Physical Sciences Workshop, NeurIPS 2023

  25. arXiv:2311.12179  [pdf, other

    cs.CL

    Leveraging Closed-Access Multilingual Embedding for Automatic Sentence Alignment in Low Resource Languages

    Authors: Idris Abdulmumin, Auwal Abubakar Khalid, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Lukman Jibril Aliyu, Babangida Sani, Bala Mairiga Abduljalil, Sani Ahmad Hassan

    Abstract: The importance of qualitative parallel data in machine translation has long been determined but it has always been very difficult to obtain such in sufficient quantity for the majority of world languages, mainly because of the associated cost and also the lack of accessibility to these languages. Despite the potential for obtaining parallel datasets from online articles using automatic approaches,… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: To appear in the proceedings of ICCAIT 2023. 6 pages, 2 figures

  26. arXiv:2310.02104  [pdf, other

    cs.SE

    An empirical study of ChatGPT-3.5 on question answering and code maintenance

    Authors: Md Mahir Asef Kabir, Sk Adnan Hassan, Xiaoyin Wang, Ying Wang, Hai Yu, Na Meng

    Abstract: Ever since the launch of ChatGPT in 2022, a rising concern is whether ChatGPT will replace programmers and kill jobs. Motivated by this widespread concern, we conducted an empirical study to systematically compare ChatGPT against programmers in question-answering and software-maintaining. We reused a dataset introduced by prior work, which includes 130 StackOverflow (SO) discussion threads referre… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  27. arXiv:2307.15469  [pdf, other

    cs.NI eess.SP

    SpaceRIS: LEO Satellite Coverage Maximization in 6G Sub-THz Networks by MAPPO DRL and Whale Optimization

    Authors: Sheikh Salman Hassan, Yu Min Park, Yan Kyaw Tun, Walid Saad, Zhu Han, Choong Seon Hong

    Abstract: Satellite systems face a significant challenge in effectively utilizing limited communication resources to meet the demands of ground network traffic, characterized by asymmetrical spatial distribution and time-varying characteristics. Moreover, the coverage range and signal transmission distance of low Earth orbit (LEO) satellites are restricted by notable propagation attenuation, molecular absor… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  28. arXiv:2307.14623  [pdf, other

    cs.LG cs.AI cs.CE cs.DC

    BubbleML: A Multi-Physics Dataset and Benchmarks for Machine Learning

    Authors: Sheikh Md Shakeel Hassan, Arthur Feeney, Akash Dhruv, Jihoon Kim, Youngjoon Suh, Jaiyoung Ryu, Yoonjin Won, Aparna Chandramowlishwaran

    Abstract: In the field of phase change phenomena, the lack of accessible and diverse datasets suitable for machine learning (ML) training poses a significant challenge. Existing experimental datasets are often restricted, with limited availability and sparse ground truth data, impeding our understanding of this complex multiphysics phenomena. To bridge this gap, we present the BubbleML Dataset \footnote{\la… ▽ More

    Submitted 24 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: Submitted to Neurips Datasets and Benchmarks Track 2023

  29. arXiv:2306.14476  [pdf, other

    cs.LG cs.AI

    STEF-DHNet: Spatiotemporal External Factors Based Deep Hybrid Network for Enhanced Long-Term Taxi Demand Prediction

    Authors: Sheraz Hassan, Muhammad Tahir, Momin Uppal, Zubair Khalid, Ivan Gorban, Selim Turki

    Abstract: Accurately predicting the demand for ride-hailing services can result in significant benefits such as more effective surge pricing strategies, improved driver positioning, and enhanced customer service. By understanding the demand fluctuations, companies can anticipate and respond to consumer requirements more efficiently, leading to increased efficiency and revenue. However, forecasting demand in… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 8 pages, 3 Figures

  30. arXiv:2306.04010  [pdf, other

    cs.NE cs.ET

    A Novel Implementation Methodology for Error Correction Codes on a Neuromorphic Architecture

    Authors: Sahil Hassan, Parker Dattilo, Ali Akoglu

    Abstract: The Internet of Things infrastructure connects a massive number of edge devices with an increasing demand for intelligent sensing and inferencing capability. Such data-sensitive functions necessitate energy-efficient and programmable implementations of Error Correction Codes (ECC) and decoders. The algorithmic flow of ECCs with concurrent accumulation and comparison types of operations are innatel… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: To be published in IEEE Transactions On Computer-Aided Design Of Integrated Circuits And Systems (TCAD)

  31. arXiv:2305.19981  [pdf, other

    cs.CL

    MedNgage: A Dataset for Understanding Engagement in Patient-Nurse Conversations

    Authors: Yan Wang, Heidi Ann Scharf Donovan, Sabit Hassan, Mailhe Alikhani

    Abstract: Patients who effectively manage their symptoms often demonstrate higher levels of engagement in conversations and interventions with healthcare practitioners. This engagement is multifaceted, encompassing cognitive and socio-affective dimensions. Consequently, it is crucial for AI systems to understand the engagement in natural conversations between patients and practitioners to better contribute… ▽ More

    Submitted 20 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: ACL Findings 2023

  32. arXiv:2305.17013  [pdf, other

    cs.CL

    D-CALM: A Dynamic Clustering-based Active Learning Approach for Mitigating Bias

    Authors: Sabit Hassan, Malihe Alikhani

    Abstract: Despite recent advancements, NLP models continue to be vulnerable to bias. This bias often originates from the uneven distribution of real-world data and can propagate through the annotation process. Escalated integration of these models in our lives calls for methods to mitigate bias without overbearing annotation costs. While active learning (AL) has shown promise in training models with a small… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL FINDINGS 2023

  33. arXiv:2304.12396  [pdf, other

    cs.DC

    CEDR-API: Productive, Performant Programming of Domain-Specific Embedded Systems

    Authors: Joshua Mack, Serhan Gener, Sahil Hassan, H. Umut Suluhan, Ali Akoglu

    Abstract: As the computing landscape evolves, system designers continue to explore design methodologies that leverage increased levels of heterogeneity to push performance within limited size, weight, power, and cost budgets. One such methodology is to build Domain-Specific System on Chips (DSSoCs) that promise increased productivity through narrowed scope of their target application domain. In previous wor… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: 10 pages, 10 figures. Accepted for publication in the 2023 International Parallel and Distributed Processing Symposium (IPDPS) Heterogeneity in Computing Workshop (HCW)

  34. arXiv:2302.09618  [pdf, other

    cs.CL

    Multilingual Content Moderation: A Case Study on Reddit

    Authors: Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran, Malihe Alikhani

    Abstract: Content moderation is the process of flagging content based on pre-defined platform rules. There has been a growing need for AI moderators to safeguard users as well as protect the mental health of human moderators from traumatic content. While prior works have focused on identifying hateful/offensive language, they are not adequate for meeting the challenges of content moderation since 1) moderat… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

  35. arXiv:2212.07979  [pdf, other

    cs.SE cs.CR cs.HC cs.PL

    Improving Developers' Understanding of Regex Denial of Service Tools through Anti-Patterns and Fix Strategies

    Authors: Sk Adnan Hassan, Zainab Aamir, Dongyoon Lee, James C. Davis, Francisco Servant

    Abstract: Regular expressions are used for diverse purposes, including input validation and firewalls. Unfortunately, they can also lead to a security vulnerability called ReDoS (Regular Expression Denial of Service), caused by a super-linear worst-case execution time during regex matching. Due to the severity and prevalence of ReDoS, past work proposed automatic tools to detect and fix regexes. Although th… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: IEEE Security & Privacy 2023

  36. arXiv:2212.05757  [pdf, other

    cs.NI

    Satellite-based ITS Data Offloading & Computation in 6G Networks: A Cooperative Multi-Agent Proximal Policy Optimization DRL with Attention Approach

    Authors: Sheikh Salman Hassan, Yu Min Park, Yan Kyaw Tun, Walid Saad, Zhu Han, Choong Seon Hong

    Abstract: The proliferation of intelligent transportation systems (ITS) has led to increasing demand for diverse network applications. However, conventional terrestrial access networks (TANs) are inadequate in accommodating various applications for remote ITS nodes, i.e., airplanes and ships. In contrast, satellite access networks (SANs) offer supplementary support for TANs, in terms of coverage flexibility… ▽ More

    Submitted 14 June, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 18 Pages, 20 Figures, Submitted to IEEE Transactions on Mobile Computing (TMC)-(Under Major Revision)

  37. Spam Review Detection Using Deep Learning

    Authors: G. M. Shahariar, Swapnil Biswas, Faiza Omar, Faisal Muhammad Shah, Samiha Binte Hassan

    Abstract: A robust and reliable system of detecting spam reviews is a crying need in todays world in order to purchase products without being cheated from online sites. In many online sites, there are options for posting reviews, and thus creating scopes for fake paid reviews or untruthful reviews. These concocted reviews can mislead the general public and put them in a perplexity whether to believe the rev… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Journal ref: 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON). IEEE, 2019

  38. arXiv:2209.12146  [pdf

    eess.SY cs.LG stat.ML

    Machine Learning and Artificial Intelligence-Driven Multi-Scale Modeling for High Burnup Accident-Tolerant Fuels for Light Water-Based SMR Applications

    Authors: Md. Shamim Hassan, Abid Hossain Khan, Richa Verma, Dinesh Kumar, Kazuma Kobayashi, Shoaib Usman, Syed Alam

    Abstract: The concept of small modular reactor has changed the outlook for tackling future energy crises. This new reactor technology is very promising considering its lower investment requirements, modularity, design simplicity, and enhanced safety features. The application of artificial intelligence-driven multi-scale modeling (neutronics, thermal hydraulics, fuel performance, etc.) incorporating Digital… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Journal ref: Handbook of Smart Energy Systems, 2022

  39. arXiv:2209.08207  [pdf, other

    cs.CL

    APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations

    Authors: Katherine Atwell, Sabit Hassan, Malihe Alikhani

    Abstract: Using style-transfer models to reduce offensiveness of social media comments can help foster a more inclusive environment. However, there are no sizable datasets that contain offensive texts and their inoffensive counterparts, and fine-tuning pretrained models with limited labeled data can lead to the loss of original meaning in the style-transferred text. To address this issue, we provide two maj… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: To be published in Proceedings of COLING 2022, the 29th International Conference on Computational Linguistics

  40. Joint Trajectory and Resource Optimization of MEC-Assisted UAVs in Sub-THz Networks: A Resources-based Multi-Agent Proximal Policy Optimization DRL with Attention Mechanism

    Authors: Yu Min Park, Sheikh Salman Hassan, Yan Kyaw Tun, Zhu Han, Choong Seon Hong

    Abstract: THz band communication technology will be used in the 6G networks to enable high-speed and high-capacity data service demands. However, THz-communication losses arise owing to limitations, i.e., molecular absorption, rain attenuation, and coverage range. Furthermore, to maintain steady THz-communications and overcome coverage distances in rural and suburban regions, the required number of BSs is v… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 13 pages, 12 figures

  41. arXiv:2208.11951  [pdf, other

    cs.IT eess.SP

    Design of an Efficient CSI Feedback Mechanism in Massive MIMO Systems: A Machine Learning Approach using Empirical Data

    Authors: Muhammad Karam Shehzad, Luca Rose, Stefan Wesemann, Mohamad Assaad, Syed Ali Hassan

    Abstract: Massive multiple-input multiple-output (mMIMO) regime reaps the benefits of spatial diversity and multiplexing gains, subject to precise channel state information (CSI) acquisition. In the current communication architecture, the downlink CSI is estimated by the user equipment (UE) via dedicated pilots and then fed back to the gNodeB (gNB). The feedback information is compressed with the goal of re… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

  42. arXiv:2208.11852  [pdf

    cs.LG

    An Empirical Analysis of the Efficacy of Different Sampling Techniques for Imbalanced Classification

    Authors: Asif Newaz, Shahriar Hassan, Farhan Shahriyar Haq

    Abstract: Learning from imbalanced data is a challenging task. Standard classification algorithms tend to perform poorly when trained on imbalanced data. Some special strategies need to be adopted, either by modifying the data distribution or by redesigning the underlying classification algorithm to achieve desirable performance. The prevalence of imbalance in real-world datasets has led to the creation of… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: Submitted to "Information Sciences" (Elsevier)

  43. A Hardware-based HEFT Scheduler Implementation for Dynamic Workloads on Heterogeneous SoCs

    Authors: Alexander Fusco, Sahil Hassan, Joshua Mack, Ali Akoglu

    Abstract: Non-uniform performance and power consumption across the processing elements (PEs) of heterogeneous SoCs increase the computation complexity of the task scheduling problem compared to homogeneous architectures. Latency of a software-based scheduler with the increased heterogeneity level in terms of number and types of PEs creates the necessity of deploying a scheduler as an overlay processor in ha… ▽ More

    Submitted 13 November, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Presented at 2022 IFIP/IEEE 30th International Conference on Very Large Scale Integration (October 3-5)

    Journal ref: IFIP/IEEE 30th Int. Conf. on Very Large Scale Integr. (VLSI-SoC), 2022, pp. 1-6

  44. arXiv:2207.04021  [pdf, ps, other

    cs.CL

    ASL-Homework-RGBD Dataset: An annotated dataset of 45 fluent and non-fluent signers performing American Sign Language homeworks

    Authors: Saad Hassan, Matthew Seita, Larwan Berke, Yingli Tian, Elaine Gale, Sooyeon Lee, Matt Huenerfauth

    Abstract: We are releasing a dataset containing videos of both fluent and non-fluent signers using American Sign Language (ASL), which were collected using a Kinect v2 sensor. This dataset was collected as a part of a project to develop and evaluate computer vision algorithms to support new technologies for automatic detection of ASL fluency attributes. A total of 45 fluent and non-fluent participants were… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  45. arXiv:2207.00426  [pdf, ps, other

    stat.CO cs.DC

    Parallel square-root statistical linear regression for inference in nonlinear state space models

    Authors: Fatemeh Yaghoobi, Adrien Corenflos, Sakira Hassan, Simo S�rkk�

    Abstract: In this article, we introduce parallel-in-time methods for state and parameter estimation in general nonlinear non-Gaussian state-space models using the statistical linear regression and the iterated statistical posterior linearization paradigms. We also reformulate the proposed methods in a square-root form, resulting in improved numerical stability while preserving the parallelization capabiliti… ▽ More

    Submitted 5 April, 2023; v1 submitted 29 June, 2022; originally announced July 2022.

  46. Using BERT Embeddings to Model Word Importance in Conversational Transcripts for Deaf and Hard of Hearing Users

    Authors: Akhter Al Amin, Saad Hassan, Cecilia O. Alm, Matt Huenerfauth

    Abstract: Deaf and hard of hearing individuals regularly rely on captioning while watching live TV. Live TV captioning is evaluated by regulatory agencies using various caption evaluation metrics. However, caption evaluation metrics are often not informed by preferences of DHH users or how meaningful the captions are. There is a need to construct caption evaluation metrics that take the relative importance… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 5 pages, 3 tables, 1 figure

  47. arXiv:2205.09084  [pdf, other

    cs.NI eess.SP

    Industry 5.0 is Coming: A Survey on Intelligent NextG Wireless Networks as Technological Enablers

    Authors: Shah Zeb, Aamir Mahmood, Sunder Ali Khowaja, Kapal Dev, Syed Ali Hassan, Nawab Muhammad Faseeh Qureshi, Mikael Gidlund, Paolo Bellavista

    Abstract: Industry 5.0 vision, a step toward the next industrial revolution and enhancement to Industry 4.0, envisioned the new goals of resilient, sustainable, and human-centric approaches in diverse emerging applications, e.g., factories-of-the-future, digital society. The vision seeks to leverage human intelligence and creativity in nexus with intelligent, efficient, and reliable cognitive collaborating… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  48. arXiv:2204.12526  [pdf, other

    q-bio.QM cs.LG stat.ML

    Identification of feasible pathway information for c-di-GMP binding proteins in cellulose production

    Authors: Syeda Sakira Hassan, Rahul Mangayil, Tommi Aho, Olli Yli-Harja, Matti Karp

    Abstract: In this paper, we utilize a machine learning approach to identify the significant pathways for c-di-GMP signaling proteins. The dataset involves gene counts from 12 pathways and 5 essential c-di-GMP binding domains for 1024 bacterial genomes. Two novel approaches, Least absolute shrinkage and selection operator (Lasso) and Random forests, have been applied for analyzing and modeling the dataset. B… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Journal ref: EMBEC & NBC 2017. EMBEC NBC 2017 2017. IFMBE Proceedings, vol 65. Springer, Singapore

  49. CEDR -- A Compiler-integrated, Extensible DSSoC Runtime

    Authors: Joshua Mack, Sahil Hassan, Nirmal Kumbhare, Miguel Castro-Gonzalez, Ali Akoglu

    Abstract: In this work, we present CEDR, a Compiler-integrated, Extensible Domain Specific System on Chip Runtime ecosystem to facilitate research towards addressing the challenges of architecture, system software and application development with distinct plug-and-play integration points in a unified compile time and run time workflow. We demonstrate the utility of CEDR on the Xilinx Zynq MPSoC-ZCU102 for e… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: 35 pages single column, 16 figures, 7 tables. Accepted for publication in the ACM Transactions on Embedded and Computing Systems

  50. arXiv:2204.03794  [pdf

    cs.SE

    On the Importance of Performing App Analysis Within Peer Groups

    Authors: Safwat Hassan, Heng Li, Ahmed E. Hassan

    Abstract: The competing nature of the app market motivates us to shift our focus on apps that provide similar functionalities and directly compete with each other (i.e., peer apps). In this work, we study the ratings and the review text of 100 Android apps across 10 peer app groups. We highlight the importance of performing peer-app analysis by showing that it can provide a unique perspective over performin… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.