\documentclass[10pt]{article} \usepackage{fontspec} \usepackage{csquotes} \usepackage{polyglossia} \setmainlanguage{english} \usepackage[backend=biber,style=ieee,maxnames=8]{biblatex} \renewcommand*{\bibfont}{\small} \usepackage{titling} %\renewcommand*{\bibfont}{\footnotesize} \addbibresource{bib/csa.bib} \title{Suren A. Chilingaryan} \preauthor{} \postauthor{} \author{} \predate{} \postdate{} \date{} %\posttitle{\par\end{center}} \setlength{\droptitle}{-4em} \begin{document} \maketitle I am specializing in the domain of high-performance and heterogeneous computing, computer architectures, and parallel algorithms. On a technical side, I have experience in performance analysis and optimization, parallel programming, low-latency communication, and cloud platforms. Working at Institute of Data Processing and Electronics (IPE) at KIT, I apply these technologies to build software instrumentation for distributed data acquisition and control systems. I built and are currently maintaining a highly-available cloud platform for KATRIN (KArlsruhe TRItium Neutrino) data acquisition and slow-control systems~\cite{katrin2015detector,katrin2018first}. Parts of the system are adapted to support ASEC (Aragats Space Environmental Center) and SEVAN (Space Environmental Viewing and Analysis Network) particle detector networks in Armenia to study thunderstorm phenomena~\cite{csa2009sevan, chili2010thunderstorm}. I led a software work package of the UFO (Ultra Fast tOmography) project aimed to build a novel instrumentation for high-speed synchrotron imaging with online reconstruction and image-based feedback loop~\cite{kopmann2017ufo}. We developed a control system integrating the beam line devices with a GPU-based image-processing cluster and steering the data from the cameras until the storage~\cite{stevanovic2015concert}. IPE is actively designing novel electronics for multiple collaborations~\cite{caselle2013camera,caselle2014kapture}. Our group is looking for a hybrid solutions coupling the high-speed electronics with fast, but flexible software running on GPUs and other parallel accelerators~\cite{vogelgesang2016dgma}. For instance, recently we have performed a case-study aimed to evaluate the possibility of building the next generation of CMS track trigger using GPUs with round-trip latency below 6 us~\cite{mohr2017cms}. It is a challenging task to design an efficient computing system. Well designed data flow, a hierarchy of intelligent caches, and efficient parallel algorithms can drastically reduce required investments. Throughout all projects, we take a holistic approach to understand project requirements, identify bottlenecks, and optimize performance-critical components. To get an in-depth understanding of available parallel architectures, I have systematically applied micro-benchmarking techniques. It allowed to find multiple undocumented properties of the available hardware and to develop a range of techniques to balance the load between different computational and memory units achieving higher hardware utilization~\cite{csa2018sbac}. We have developed a pipelined image processing framework and contributed parallel algorithms addressing various hardware platforms including IBM Power, Intel Xeon Phi, and multiple GPU architectures~\cite{csa2011pyhst,vogelgesang2012ufo,ashkarin2015,rshkarin2015,cavadini2018upiv}. To enable interactive remote visualization of large tomographic volumes, we develop a web-based visualization framework combining client- and server-side rendering techniques~\cite{ntj2017wave}. Because of the client-side component, high interactivity is achieved with only small investments in the data center hardware. On the other hand, the server-side component allows to improve quality on demand and makes visualization possible also for slow hand-held devices. \printbibliography \end{document}