The Nix packages collection is large with over 60 000 packages. However, chances are that sometimes the package you need is not available. You must integrate it yourself. I needed for some fonts which… | Continue reading
When using an operating system, upgrading packages or installing new ones are common tasks that introduce the risk of affecting the stability of the system. NixOS is a Linux distribution that ensures… | Continue reading
Having your Kubernetes cluster up and running is just the start of your journey and you now need to operate. To secure its access, user identities must be declared along with authentication and… | Continue reading
At DockerCon EU 2018 in Barcelona, Matt Butcher, Principal Engineer at Microsoft and inventor of Helm, introduced CNAB, Cloud Native Application Bundles, a packaging format for distributed applications, along with Duffle, a CLI tool to run these bundles. The introduction came dur … | Continue reading
This article goes over the fundamental principles of Machine Learning and what tools are currently used to run machine learning algorithms. We will then see how a resource manager such as YARN can be useful in this context and how it can help the algorithms to run smoothly. This … | Continue reading
Let’s dive into the new features offered by the 2.3 distribution of Apache Spark. This article is a composition of the following talks seen at the DataWorks Summit 2018 and additional research: Apache Spark 2.3 boosts advanced analytics & deep learning by Yanbo Liang, Staff Softw … | Continue reading
Jesus Camacho Rodriguez from Hortonworks held a talk “Accelerating query processing with materialized views in Apache Hive” about the new materialized view feature coming in Apache Hive 3.0. This article will cover the main principle of this feature, give some examples and the im … | Continue reading
The integration of TensorFlow With Spark has a lot of potential and create new opportunities. I wrote this article in reaction to a conference I have watched during the DataWorks Summit 2018 in Berlin. This conference was about the new features of the 2.3 release of Apache Spark, … | Continue reading