Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2024
<italic>InSS</italic>: An Intelligent Scheduling Orchestrator for Multi-GPU Inference With Spatio-Temporal Sharing
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 10Pages 1735–1748https://doi.org/10.1109/TPDS.2024.3430063As the applications of AI proliferate, it is critical to increase the throughput of online DNN inference services. Multi-process service (MPS) improves the utilization rate of GPU resources by spatial-sharing, but it also brings unique challenges. First, ...
- research-articleNovember 2023
Hopscotch: A Hardware-Software Co-Design for Efficient Cache Resizing on Multi-Core SoCs
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 1Pages 89–104https://doi.org/10.1109/TPDS.2023.3332711Following the trend of increasing autonomy in real-time systems, multi-core System-on-Chips (SoCs) have enabled devices to better handle the large streams of data and intensive computation required by such autonomous systems. In modern multi-core SoCs, ...
- research-articleOctober 2023
AutoRS: Environment-Dependent Real-Time Scheduling for End-to-End Autonomous Driving
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 34, Issue 12Pages 3238–3252https://doi.org/10.1109/TPDS.2023.3323975The rapid development of autonomous driving poses new research challenges for on-vehicle computing system. The execution time of autonomous driving tasks heavily depends on the driving environment. As the scene becomes complex, task execution time ...
- research-articleDecember 2022
Highly Accurate Clock Synchronization With Drift Correction for the Controller Area Network
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 12Pages 4071–4082https://doi.org/10.1109/TPDS.2022.3179316Modern vehicles, that have to be considered as safety-critical cyber-physical systems, require highly accurate <italic>clock synchronization</italic> (CS) among their distributed computing devices. Since Controller Area Network (CAN) is the predominant in-...
- research-articleDecember 2022
Run-Time Remapping Algorithm of Dataflow Actors on NoC-Based Heterogeneous MPSoCs
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 12Pages 3959–3976https://doi.org/10.1109/TPDS.2022.3177957Multiprocessor system-on-chip (MPSoC) platforms have been emerging as the main solution to cope with processor frequency ceiling and power density issues while still improving performances. Then, network-on-chip (NoC) has been adopted to provide the ...
-
- research-articleDecember 2022
DAG Scheduling and Analysis on Multi-Core Systems by Modelling Parallelism and Dependency
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 12Pages 4019–4038https://doi.org/10.1109/TPDS.2022.3177046With ever more complex functionalities being implemented in emerging real-time applications, multi-core systems are demanded for high performance, with directed acyclic graphs (DAG) being used to model functional dependencies. For a single DAG task, our ...
- research-articleApril 2020
HRHS: A High-Performance Real-Time Hardware Scheduler
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 31, Issue 4Pages 897–908https://doi.org/10.1109/TPDS.2019.2952136This article represents an on-line time-predictable distributed hardware scheduler solution, suitable for many-core systems. We have partitioned the Main scheduler into uniform Partial schedulers to achieve a significant gain in term of performance and ...
- research-articleJanuary 2020
Scheduling Parallel Real-Time Tasks on the Minimum Number of Processors
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 31, Issue 1Pages 171–186https://doi.org/10.1109/TPDS.2019.2929048Recently, several parallel frameworks have emerged to utilize the increasing computational capacity of multiprocessors. Parallel tasks are distinguished from traditional sequential tasks in that the subtasks contained in a single parallel task can ...
- research-articleJanuary 2019
Peak Power Management to Meet Thermal Design Power in Fault-Tolerant Embedded Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 30, Issue 1Pages 161–173https://doi.org/10.1109/TPDS.2018.2858816Multicore platforms provide a great opportunity for implementation of fault-tolerance techniques to achieve high reliability in real-time embedded systems. Passive redundancy is well-suited for multicore platforms and a well-established technique to ...
- research-articleSeptember 2018
Unleashing Fine-Grained Parallelism on Embedded Many-Core Accelerators with Lightweight OpenMP Tasking
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 29, Issue 9Pages 2150–2163https://doi.org/10.1109/TPDS.2018.2814602In recent years, programmable many-core accelerators (PMCAs) have been introduced in embedded systems to satisfy stringent performance/Watt requirements. This has increased the urge for programming models capable of effectively leveraging hundreds to ...
- research-articleDecember 2017
Energy-Efficient Scheduling Algorithms for Real-Time Parallel Applications on Heterogeneous Distributed Embedded Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 28, Issue 12Pages 3426–3442https://doi.org/10.1109/TPDS.2017.2730876Energy consumption minimization is one of the primary design requirements for heterogeneous distributed systems. State-of-the-art algorithms are used to study the problem of minimizing the energy consumption of a real-time parallel application with ...
- research-articleJuly 2017
Response-Time Analysis in Hierarchically-Scheduled Time-Partitioned Distributed Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 28, Issue 7Pages 2017–2030https://doi.org/10.1109/TPDS.2016.2642960This paper develops an offset-based response-time analysis technique for analyzing complex distributed real-time systems where processing and communication resources use the time-partitioning strategy to isolate the operation of separate software ...
- research-articleJune 2017
Real-Time GPU Resource Management with Loadable Kernel Modules
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 28, Issue 6Pages 1715–1727https://doi.org/10.1109/TPDS.2016.2630697Graphics processing unit (GPU) programming environments have matured for general-purpose computing on GPUs. Significant challenges for GPUs include system software support for bounded response times and guaranteed throughput. In recent years, GPU ...
- research-articleDecember 2016
Fault-Tolerant Scheduling for Real-Time Scientific Workflows with Elastic Resource Provisioning in Virtualized Clouds
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 27, Issue 12Pages 3501–3517https://doi.org/10.1109/TPDS.2016.2543731Clouds are becoming an important platform for scientific workflow applications. However, with many nodes being deployed in clouds, managing reliability of resources becomes a critical issue, especially for the real-time scientific workflow execution ...
- research-articleNovember 2016
Enabling Data-Centric Distribution Technology for Partitioned Embedded Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 27, Issue 11Pages 3186–3198https://doi.org/10.1109/TPDS.2016.2531695Modern complex embedded systems are evolving into mixed-criticality systems in order to satisfy a wide set of non-functional requirements such as security, cost, weight, timing or power consumption. Partitioning is an enabling technology for this ...
- research-articleOctober 2015
Leveraging Parallel Communications for Minimizing Energy Consumption on Smartphones
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 26, Issue 10Pages 2778–2790https://doi.org/10.1109/TPDS.2014.2360693Recent energy measurements on smartphones have shown that parallel communications (e.g., data transfer and voice call) require less energy than their stand-alone execution. Guided by these results, we investigate the possibility of scheduling ...
- research-articleOctober 2015
Deadline Assignment in EDF Schedulers for Real-Time Distributed Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 26, Issue 10Pages 2671–2684https://doi.org/10.1109/TPDS.2014.2359449Real-time distributed systems contain end-to-end flows, which are distributed actions composed of sequences of tasks activated through messages. Such flows usually have an end-to-end deadline but the internal tasks and messages do not have specific timing ...
- research-articleOctober 2015
A Lease Based Hybrid Design Pattern for Proper-Temporal-Embedding of Wireless CPS Interlocking
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 26, Issue 10Pages 2630–2642https://doi.org/10.1109/TPDS.2014.2358224Cyber-Physical Systems (CPS) integrate discrete-time computing and continuous-time physical-world entities, which are often wirelessly interlinked. The use of wireless safety-critical CPS requires safety guarantees despite communication faults. This paper ...
- research-articleAugust 2015
Design of Mixed Synchronous/Asynchronous Systems with Multiple Clocks
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 26, Issue 8Pages 2220–2232https://doi.org/10.1109/TPDS.2014.2346171Today's distributed systems are commonly equipped with both synchronous and asynchronous components controlled with multiple clocks. The key challenges in designing such systems are (1) how to model multi-clocked local synchronous component, local ...
- research-articleMarch 2015
Contention-Aware Energy Management Scheme for NoC-Based Multicore Real-Time Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 26, Issue 3Pages 691–701https://doi.org/10.1109/TPDS.2014.2307866Network-on-Chip (NoC) has emerged as interconnect paradigm in state-of-the-art multi/many core architectures. Voltage and frequency island (VFI) was recently adopted as an effective energy management technique for large scale multicore chip designs. ...