-
Design of a Modular GaN-based Three-Phase Three-Level ANPC Inverter
Authors:
Angelo Di Cataldo,
Hamed Eivazi,
Giuseppe Aiello,
Dario Patti,
Giacomo Scelba,
Mario Cacciato,
Francesco Gennaro
Abstract:
This paper presents the design of an 800 V 11 kVA three-level three-phase active neutral point clamped inverter, utilizing 650 V gallium nitride enhancement-mode high-electron-mobility transistors, to evaluate its feasibility in electric traction systems. The modular approach of the presented power converter design is detailed discussed and the different printed circuit boards composing the power…
▽ More
This paper presents the design of an 800 V 11 kVA three-level three-phase active neutral point clamped inverter, utilizing 650 V gallium nitride enhancement-mode high-electron-mobility transistors, to evaluate its feasibility in electric traction systems. The modular approach of the presented power converter design is detailed discussed and the different printed circuit boards composing the power converter are presented, together with critical design issues. The paper includes the gate driver design, as well as the thermal analysis and parasitics extraction using ANSYS Q3D. Extractor.
△ Less
Submitted 11 April, 2024;
originally announced June 2024.
-
Attention-Based Deep Reinforcement Learning for Qubit Allocation in Modular Quantum Architectures
Authors:
Enrico Russo,
Maurizio Palesi,
Davide Patti,
Giuseppe Ascia,
Vincenzo Catania
Abstract:
Modular, distributed and multi-core architectures are currently considered a promising approach for scalability of quantum computing systems. The integration of multiple Quantum Processing Units necessitates classical and quantum-coherent communication, introducing challenges related to noise and quantum decoherence in quantum state transfers between cores. Optimizing communication becomes imperat…
▽ More
Modular, distributed and multi-core architectures are currently considered a promising approach for scalability of quantum computing systems. The integration of multiple Quantum Processing Units necessitates classical and quantum-coherent communication, introducing challenges related to noise and quantum decoherence in quantum state transfers between cores. Optimizing communication becomes imperative, and the compilation and mapping of quantum circuits onto physical qubits must minimize state transfers while adhering to architectural constraints. The compilation process, inherently an NP-hard problem, demands extensive search times even with a small number of qubits to be solved to optimality. To address this challenge efficiently, we advocate for the utilization of heuristic mappers that can rapidly generate solutions. In this work, we propose a novel approach employing Deep Reinforcement Learning (DRL) methods to learn these heuristics for a specific multi-core architecture. Our DRL agent incorporates a Transformer encoder and Graph Neural Networks. It encodes quantum circuits using self-attention mechanisms and produce outputs through an attention-based pointer mechanism that directly signifies the probability of matching logical qubits with physical cores. This enables the selection of optimal cores for logical qubits efficiently. Experimental evaluations show that the proposed method can outperform baseline approaches in terms of reducing inter-core communications and minimizing online time-to-solution. This research contributes to the advancement of scalable quantum computing systems by introducing a novel learning-based heuristic approach for efficient quantum circuit compilation and mapping.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Assessing the Role of Communication in Scalable Multi-Core Quantum Architectures
Authors:
Maurizio Palesi,
Enrico Russo,
Davide Patti,
Giuseppe Ascia,
Vincenzo Catania
Abstract:
Multi-core quantum architectures offer a solution to the scalability limitations of traditional monolithic designs. However, dividing the system into multiple chips introduces a critical bottleneck: communication between cores. This paper introduces qcomm, a simulation tool designed to assess the impact of communication on the performance of scalable multi-core quantum architectures. Qcomm allows…
▽ More
Multi-core quantum architectures offer a solution to the scalability limitations of traditional monolithic designs. However, dividing the system into multiple chips introduces a critical bottleneck: communication between cores. This paper introduces qcomm, a simulation tool designed to assess the impact of communication on the performance of scalable multi-core quantum architectures. Qcomm allows users to adjust various architectural and physical parameters of the system, and outputs various communication metrics. We use qcomm to perform a preliminary study on how these parameters affect communication performance in a multi-core quantum system.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Deep Reinforcement Learning based Online Scheduling Policy for Deep Neural Network Multi-Tenant Multi-Accelerator Systems
Authors:
Francesco G. Blanco,
Enrico Russo,
Maurizio Palesi,
Davide Patti,
Giuseppe Ascia,
Vincenzo Catania
Abstract:
Currently, there is a growing trend of outsourcing the execution of DNNs to cloud services. For service providers, managing multi-tenancy and ensuring high-quality service delivery, particularly in meeting stringent execution time constraints, assumes paramount importance, all while endeavoring to maintain cost-effectiveness. In this context, the utilization of heterogeneous multi-accelerator syst…
▽ More
Currently, there is a growing trend of outsourcing the execution of DNNs to cloud services. For service providers, managing multi-tenancy and ensuring high-quality service delivery, particularly in meeting stringent execution time constraints, assumes paramount importance, all while endeavoring to maintain cost-effectiveness. In this context, the utilization of heterogeneous multi-accelerator systems becomes increasingly relevant. This paper presents RELMAS, a low-overhead deep reinforcement learning algorithm designed for the online scheduling of DNNs in multi-tenant environments, taking into account the dataflow heterogeneity of accelerators and memory bandwidths contentions. By doing so, service providers can employ the most efficient scheduling policy for user requests, optimizing Service-Level-Agreement (SLA) satisfaction rates and enhancing hardware utilization. The application of RELMAS to a heterogeneous multi-accelerator system composed of various instances of Simba and Eyeriss sub-accelerators resulted in up to a 173% improvement in SLA satisfaction rate compared to state-of-the-art scheduling techniques across different workload scenarios, with less than a 1.5% energy overhead.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning
Authors:
Enrico Russo,
Francesco Giulio Blanco,
Maurizio Palesi,
Giuseppe Ascia,
Davide Patti,
Vincenzo Catania
Abstract:
This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs). It introduces a novel approach utilizing Deep Reinforcement Learning for tenant-specific QoS management in multi-tenant, multi-accelerator cloud environments. The chosen SLI, deadline hit rate, all…
▽ More
This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs). It introduces a novel approach utilizing Deep Reinforcement Learning for tenant-specific QoS management in multi-tenant, multi-accelerator cloud environments. The chosen SLI, deadline hit rate, allows clients to tailor QoS for each service request. A novel online scheduling algorithm for Deep Neural Networks in multi-accelerator systems is proposed, with a focus on guaranteeing tenant-wise, model-specific QoS levels while considering real-time constraints.
△ Less
Submitted 9 February, 2024;
originally announced March 2024.
-
A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms
Authors:
Cristina Silvano,
Daniele Ielmini,
Fabrizio Ferrandi,
Leandro Fiorin,
Serena Curzel,
Luca Benini,
Francesco Conti,
Angelo Garofalo,
Cristian Zambelli,
Enrico Calore,
Sebastiano Fabio Schifano,
Maurizio Palesi,
Giuseppe Ascia,
Davide Patti,
Stefania Perri,
Nicola Petra,
Davide De Caro,
Luciano Lavagno,
Teodoro Urso,
Valeria Cardellini,
Gian Carlo Cardarilli,
Robert Birke
Abstract:
Recent trends in deep learning (DL) imposed hardware accelerators as the most viable solution for several classes of high-performance computing (HPC) applications such as image classification, computer vision, and speech recognition. This survey summarizes and classifies the most recent advances in designing DL accelerators suitable to reach the performance requirements of HPC applications. In par…
▽ More
Recent trends in deep learning (DL) imposed hardware accelerators as the most viable solution for several classes of high-performance computing (HPC) applications such as image classification, computer vision, and speech recognition. This survey summarizes and classifies the most recent advances in designing DL accelerators suitable to reach the performance requirements of HPC applications. In particular, it highlights the most advanced approaches to support deep learning accelerations including not only GPU and TPU-based accelerators but also design-specific hardware accelerators such as FPGA-based and ASIC-based accelerators, Neural Processing Units, open hardware RISC-V-based accelerators and co-processors. The survey also describes accelerators based on emerging memory technologies and computing paradigms, such as 3D-stacked Processor-In-Memory, non-volatile memories (mainly, Resistive RAM and Phase Change Memories) to implement in-memory computing, Neuromorphic Processing Units, and accelerators based on Multi-Chip Modules. The survey classifies the most influential architectures and technologies proposed in the last years, with the purpose of offering the reader a comprehensive perspective in the rapidly evolving field of deep learning. Finally, it provides some insights into future challenges in DL accelerators such as quantum accelerators and photonics.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Power Loss Modelling of GaN HEMT based 3L ANPC Three Phase Inverter for different PWM Techniques
Authors:
M. Cacciato,
G. Aiello,
F. Gennaro,
S. Mita,
D. Patti,
G. Scelba,
A. Sujeeth
Abstract:
The paper presents a straightforward modelling approach to compute the power loss distribution in GaN HEMT based three phase and three level (3L) active neutral point clamped (ANPC) inverters, for different pulse width modulated techniques. Conduction and switching losses averaged over each PWM switching period are analytically computed by starting from the operating conditions of the AC load and…
▽ More
The paper presents a straightforward modelling approach to compute the power loss distribution in GaN HEMT based three phase and three level (3L) active neutral point clamped (ANPC) inverters, for different pulse width modulated techniques. Conduction and switching losses averaged over each PWM switching period are analytically computed by starting from the operating conditions of the AC load and data of GaN power devices. The accuracy of the proposed analytical approach is evaluated through a circuit based power electronics simulation tool, applied to different carrier-based PWM strategies.
△ Less
Submitted 10 December, 2022;
originally announced December 2022.
-
Escaping the trap of 'blocking': a kinetic model linking economic development and political competition
Authors:
Marina Dolfin,
Damián Knopoff,
Leone Leonida,
Dario Maimone Ansaldo Patti
Abstract:
In this paper we present a kinetic model with stochastic game-type interactions, analyzing the relationship between the level of political competition in a society and the degree of economic liberalization. The above issue regards the complex interactions between economy and institutional policies intended to introduce technological innovations in a society, where technological innovations are int…
▽ More
In this paper we present a kinetic model with stochastic game-type interactions, analyzing the relationship between the level of political competition in a society and the degree of economic liberalization. The above issue regards the complex interactions between economy and institutional policies intended to introduce technological innovations in a society, where technological innovations are intended in a broad sense comprehending reforms critical to production. A special focus is placed on the political replacement effect described in a macroscopic model by Acemoglu and Robinson (AR-model, henceforth), which can determine the phenomenon of innovation 'blocking', possibly leading to economic backwardness. One of the goals of our modelization is to obtain a mesoscopic dynamical model whose macroscopic outputs are qualitatively comparable with stylized facts of the AR-model. A set of numerical solutions is presented showing the non monotonous relationship between economic liberization and political competition, which can be considered as an emergent phenomenon of the complex socio-economic interaction dynamic.
△ Less
Submitted 17 December, 2015;
originally announced February 2016.