anndata: annotated data in R

anndata is a commonly used Python package for keeping track of data and learned annotations, and can be used to read from and write to the h5ad file format. It is also the main data format used in the scanpy...

DiFlow

Developing and maintaining pipelines/workflows can be a genuine challenge. Doing this in a collaborative context adds even more to this complexity. We all dream of a flexible platform that allows us to easily express the computational requirements and is able...

A Practical Approach to Model Error - Part 2

It’s been a while since I first wrote about tackling model error using a simple model. It’s about time to come back to it. What triggered the current post is the opportunity I was given to give a masterclass in...

Viz and Math in Apache Zeppelin

For some years, I have been going back an forth between Spark-Notebook and Apache Zeppelin for different use-cases. Already 2 years ago, I made a little comparison of the two technologies. I used Spark-Notebook in order to do develop the...

Data Intuitive at Spark Summit 2016

Short version No, I’m not at the spark summit in Brussels. Time is not on my side, too much things to do. But, being there is not necessary when other people make sure they do the publicity for you. Thank...

Bullet Proof Data Science in Scala

In this post, we go over some typical aspects and challenges that occur in typical data science projects in order to extract some requirements for data analysis in the broad sense of the word. We then illustrate how we tackle...

A Practical Approach to Model Error

In this post, I want to get a better sense of the effects of model error by simulating a very simple model for the spreading of a virus. It’s based on an analysis performed by Nassim Taleb. I used the...

Thoughts on the Spark Summit Europe 2015

In this post, I summarize some of the things I picked up at the Spark Summit. Some of these require more thought or research, but at least I’ll have a to do list of things to look back at. In...

New site after two weeks

I’ve been running for about 2 weeks on Jekyll now. I spent about 2 hours during that week cleaning out some glitches and doing some more tuning. Permalinks The permalinks where not as I wanted them to be, and they...

New website location and layout

I’ve long been tempted to recreate my website using Jekyll. I had already been experimenting with it for a course website. But now I finally made the switch for my homepage. For the migration of the content from Wordpress to...