This month: a roundup of the summer’s Kedro news, some release updates, and our top picks from recent articles.
Over the last few months, we’ve been happy to welcome some new team members to the Kedro and Kedro-Viz teams, who have also joined our Technical Steering Committee. Welcome Dmitry Sorokin, Jitendra Gundaniya, Laura Couto, Ravi Kumar Pilla, and Vladimir Nikolic!
We are also pleased to announce a Kedro baby, delivered safely by one of the team, at the end of July!
We reworked the Kedro contributors guide in August, and moved it to the Kedro wiki. There are loads of different ways to contribute to Kedro and if you want to get involved, we encourage you to look at the table that introduces the Kedro contributor guide.
There have been some amazing contributions in recent weeks, including the kedro-vineyard plugin for efficient intermediate sharing in Kedro pipelines, kedro-graphql for serving Kedro projects as a GraphQL API, and kedro-pandera to bring data validation to your Kedro projects.
August 2023 saw a set of releases to introduce Python 3.11 support across Kedro, Kedro-Viz and Kedro datasets.
Kedro version 0.18.13 included these major features and improvements:
Added support for Python 3.11.
OmegaConfigLoaderfeatures: registering of custom resolvers through
CONFIG_LOADER_ARGSand support for global variables.
kedro catalog resolveCLI command that resolves dataset factories in the catalog with any explicit entries in the project pipeline.
conffolder structure for modular pipelines and updated kedro pipeline create and kedro catalog create accordingly.
Made various updates to the Kedro project template and Kedro starters: use of OmegaConfigLoader, transition from
pyproject.toml, and updated for the simplified
Kedro Viz version 6.5 added support for Python 3.11, while Kedro Viz version 6.4 added two new features: feature hint cards to highlight key features of Kedro Viz and support for displaying dataset statistics in the metadata panel for further investigation.
Kedro Datasets version 1.7.0 added
polars.GenericDataSet, a dataset backed by polars, a lightning fast dataframe package built entirely using Rust. Kedro Datasets version 1.6.0 added support for Python 3.11.
In the last few weeks we’ve published the following on the Kedro blog:
We’re always looking for collaborators to write about their experiences using Kedro, particularly if you’re working with Kedro datasets or converting an existing project to use Kedro. Get in touch with us on our Slack workspace to tell us your story.
We really enjoyed reading more on Medium about the Kedro Vineyard plugin, which is a cloud-native data manager, for data sharing using memory in data science pipelines on Kubernetes.
Quix published an interesting article called “Bridging the gap between data scientists and engineers in machine learning workflows” which is something we regularly discuss within the team.
We found a super-interesting project about font recognition that uses Kedro.
And finally, we enjoyed reading more about data streaming with Kedro over on the QuantumBlack Medium channel.
That’s it for this edition!