We all know that managing data within an organisation is difficult. Managing biological research data is particularly challenging due to the wide array of data formats and complex annotations. Without an effective data management solution, researchers struggle to access what they need when they need it, leading to project delays and even costly redundant experiments.
Lamin is an open platform for biological data. Files are kept on your storage platform of choice and Lamin provides a data lakehouse allowing users to query datasets at scale while tracing the lineage of data and code. A comprehensive annotation system means data for specific genes, conditions, cell types or other cell types can be easily retrieved using the Lamin Python package.
We partnered with Lamin to build LaminR, an R client for LaminDB. By wrapping the Lamin Python package, LaminR unlocks the Lamin platform for R users by providing access to Lamin functionality with a familiar interface. Users can now retrieve and create data and annotations using either R or Python. This enables analysts to take advantage of the strengths of each language in multi-lingual workflows and improves collaboration between teams by providing access to a common data store. See this blog post by Tyler Burns on the Lamin website for an example of LaminR in action.
We have already seen the benefits of using Lamin and LaminR when developing computational workflows for clients and we look forward to further collaborations with Lamin!