Published inParallel & Distributed Computing For Data EnthusiastsScalability of IO-Intensive ApplicationsModern computing applications can be divided into broader categories: IO-intensive and Compute-Intensive. As the name suggests…Oct 15Oct 15
Published inParallel & Distributed Computing For Data EnthusiastsVisual Guide to Distribution Patterns for Arrays in MPI, NCCLWhen multiple processes are involved in a parallel computation, they must communicate periodically to synchronize the data. Many libraries…Sep 16Sep 16
Published inParallel & Distributed Computing For Data EnthusiastsIntroduction to UCX Network ProgrammingNetwork programming is always a hassle; on the bright side, as developers, we rarely get to write network programs ourselves. In most…Sep 20Sep 20
Published inParallel & Distributed Computing For Data EnthusiastsParquet, Orc, Avro, CSV and JSONWe encounter data in many different formats. Some common examples are CSV, JSON, XML, Text, and Binary types. Every such format has a…Sep 5Sep 5
Published inParallel & Distributed Computing For Data EnthusiastsTable Abstractions for Data ProcessingTable abstractions are a great way to represent heterogeneous and structured data in memory. In data analytics, a table is a structured…Aug 30Aug 30
Published inParallel & Distributed Computing For Data EnthusiastsArrays & Tensors for Data ProcessingVectors and matrices are two foundational structures in computations. A vector is an ordered collection of elements. A matrix is a…Aug 25Aug 25
Published inParallel & Distributed Computing For Data EnthusiastsIn-Memory Data Structures for Data OperationsEvery program deals with data structures. Whether it is a distributed program running on thousands of computers or a serial program running…Aug 15Aug 15
Published inParallel & Distributed Computing For Data EnthusiastsHigh-performance Computing for Modern Data Applications — An IntroductionData-driven computing coupled with artificial intelligence (AI) is the crown jewel of modern computing. From the 1980s to around 2010…May 6May 6
Published inParallel & Distributed Computing For Data EnthusiastsHigh-Performance Network Fabrics and LibrariesNot long ago, high-performance network fabrics like InfiniBand, HPE Slingshot, or Intel Omni-Path (now CORNELIS) were primarily found in…Aug 26Aug 26
Published inParallel & Distributed Computing For Data EnthusiastsThe Three Core Concepts behind Data FrameworksThe three most important aspect of a data framework that defines its capabilities.Feb 6, 2023Feb 6, 2023