Published inParallel & Distributed Computing For Data EnthusiastsLife Cycle of a Parallel ApplicationEven though many types of hosting systems exist, including clouds, bare-metal clusters, and supercomputers, a distributed parallel data…6d ago6d ago
Cluster Resources — Job SchedulingA computer cluster consists of many computer nodes that are closely coupled together with a network. A computer cluster is used for…Feb 1Feb 1
Published inParallel & Distributed Computing For Data EnthusiastsWhy Data Analytical Queries Scale WellIn a previous article, we discussed that data-intensive applications scale well but didn’t go into the details too much. In this article…Jan 29Jan 29
Published inParallel & Distributed Computing For Data EnthusiastsScalability of IO-Intensive ApplicationsModern computing applications can be divided into broader categories: IO-intensive and Compute-Intensive. As the name suggests…Oct 15, 2024Oct 15, 2024
Published inParallel & Distributed Computing For Data EnthusiastsVisual Guide to Distribution Patterns for Arrays in MPI, NCCLWhen multiple processes are involved in a parallel computation, they must communicate periodically to synchronize the data. Many libraries…Sep 16, 2024Sep 16, 2024
Published inParallel & Distributed Computing For Data EnthusiastsIntroduction to UCX Network ProgrammingNetwork programming is always a hassle; on the bright side, as developers, we rarely get to write network programs ourselves. In most…Sep 20, 2024Sep 20, 2024
Published inParallel & Distributed Computing For Data EnthusiastsParquet, Orc, Avro, CSV and JSONWe encounter data in many different formats. Some common examples are CSV, JSON, XML, Text, and Binary types. Every such format has a…Sep 5, 2024Sep 5, 2024
Published inParallel & Distributed Computing For Data EnthusiastsTable Abstractions for Data ProcessingTable abstractions are a great way to represent heterogeneous and structured data in memory. In data analytics, a table is a structured…Aug 30, 2024Aug 30, 2024
Published inParallel & Distributed Computing For Data EnthusiastsArrays & Tensors for Data ProcessingVectors and matrices are two foundational structures in computations. A vector is an ordered collection of elements. A matrix is a…Aug 25, 2024Aug 25, 2024
Published inParallel & Distributed Computing For Data EnthusiastsIn-Memory Data Structures for Data OperationsEvery program deals with data structures. Whether it is a distributed program running on thousands of computers or a serial program running…Aug 15, 2024Aug 15, 2024