Current Research Areas

AI / Deep Learning Hardware Acceleration

Design of hardware accelerators for efficient CNN/DNN inference. Research directions include:

  • Exploiting sparsity and input repetition in convolutional networks to reduce multiplications (SIRENA accelerator)
  • Quantization-aware inference (nibble-based arithmetic, INT4/INT8)
  • FPGA and GPU-based inference optimization
  • GEMM-like convolution kernels on heterogeneous hardware (Xilinx Versal, multi-FPGA)
  • Distributed data-parallel training for HPC systems
FPGA CNN Inference Quantization Sparsity Distributed Training
Fault Tolerance & Safety-Critical Computing

Ensuring reliability and functional safety in high-performance and AI systems:

  • Bit-level redundancy exploitation in neural networks to mitigate inference faults
  • Register file randomization for robust redundant execution
  • Self-monitored dependable platforms for safety-critical applications (SELENE project)
  • Deep learning acceleration frameworks for safety-related applications
  • Predictability enforcement in many-core systems (DCFNoC)
Fault Tolerance Safety-Critical Redundant Execution AI Reliability
Heterogeneous HPC Architectures

Next-generation computing platforms combining diverse processing elements:

  • Open-source FPGA platform for shared-memory heterogeneous many-core exploration (MANGO project)
  • HPC/AI convergence: workflows combining simulation, data analytics, and ML
  • Distributed training on deeply heterogeneous systems
  • Neural network model parallelism on multi-FPGA platforms
  • Neuromorphic sensing-processing 3D-integrated chips (NimbleAI project)
HPC FPGA Many-core HPC/AI Convergence Neuromorphic
Networks-on-Chip (NoC)

Foundational expertise in on-chip interconnection networks:

  • Routing algorithms (LBDR, topology-agnostic deterministic routing)
  • Congestion management (ICARO, end-point filters)
  • TDM-based predictable NoCs: DCFNoC and HP-DCFNoC for real-time guarantees
  • Protocol-driven NoC architectures (PROSA)
  • NoC virtualization and reconfiguration (OSR, UPR)
  • Fault-tolerant routing and 3D NoC design
NoC Routing Congestion Control TDM Virtualization
RISC-V Architecture & Simulation

Open-source simulator for RISC-V architecture, used for teaching and research:

  • Behavioral single-cycle simulator (C-based)
  • 5-stage in-order pipeline implemented in Verilog (Verilator-based)
  • Support for RISC-V extension M (integer multiplication/division) and B (bit manipulation)
  • Running real OS: xv6 operating system
  • Device emulation: PLIC, CLINT, UART
RISC-V Verilog Verilator Computer Architecture
High-Performance Off-Chip Interconnects

InfiniBand and fat-tree based interconnects for HPC clusters:

  • Topology-agnostic routing algorithms (survey and evaluation)
  • Deadlock-free network reconfiguration protocols
  • Hybrid congestion control for HPC interconnection networks
  • Head-of-line blocking elimination in fat-trees (OBQA scheme)
  • Inter-subnet routing scalability for InfiniBand
InfiniBand Fat-tree Routing Congestion Management

Selected Key Publications

A selection of representative papers across research areas. See the full publication list for the complete record.

AI / DL Acceleration
SIRENA: SparsIty-REpetition aware Nibble-based hardware Accelerator for CNNs
L. Medina, J. Flich
J. Syst. Archit. · Journal · 2025
Toward Matrix Multiplication for Deep Learning Inference on the Xilinx Versal
J. Lei, J. Flich, E.S. Quintana-Ortí
PDP 2023 · Conference
Efficient Inference of Image-Based Neural Network Models in Reconfigurable Systems
J. Flich, L. Medina, I. Catalán et al.
ICIP 2022 · Conference
Fault Tolerance
Exploiting neural networks bit-level redundancy to mitigate the impact of faults at inference
I. Catalán, J. Flich, C. Hernández
J. Supercomput. · Journal · 2025
Enforcing Predictability of Many-Cores With DCFNoC
T. Picornell, J. Flich, C. Hernández, J. Duato
IEEE Trans. Computers · Journal · 2021
Improving the Robustness of Redundant Execution with Register File Randomization
I. Tuzov, P. Andreu, L. Medina, J. Flich et al.
ICCAD 2021 · Conference
Networks-on-Chip
A Survey and Evaluation of Topology-Agnostic Deterministic Routing Algorithms
J. Flich, T. Skeie, A. Mejia, O. Lysne et al.
IEEE TPDS · Journal · 2012
PROSA: Protocol-Driven Network on Chip Architecture
M. Gorgues Alonso, J. Flich
IEEE TPDS · Journal · 2018
Cost-Efficient On-Chip Routing Implementations for CMP and MPSoC Systems
S. Rodrigo, J. Flich, A. Roca et al.
IEEE TCAD · Journal · 2011
HPC Architectures
Exploring manycore architectures for next-generation HPC systems through the MANGO approach
J. Flich, G. Agosta, D. Atienza et al.
Microprocess. Microsystems · Journal · 2018
Enabling dynamic and intelligent workflows for HPC, data analytics, and AI convergence
J. Ejarque, R.M. Badia, J. Flich et al.
Future Gener. Comput. Syst. · Journal · 2022
From a FPGA Prototyping Platform to a Computing Platform: The MANGO Experience
J. Flich, R. Tornero, D. Rodriguez et al.
DATE 2021 · Conference

Research Collaborators

I collaborate with researchers from leading institutions including:

Politecnico di Milano (Italy)
EPFL — École Polytechnique Fédérale de Lausanne (Switzerland)
Univ. of Naples Federico II (Italy)
BSC — Barcelona Supercomputing Center
Simula Research Lab (Norway)
Univ. of Bologna (Italy)
Politecnico di Torino (Italy)
Univ. of Castilla–La Mancha
Univ. de Zaragoza