Google Cloud Dataflow SDK
Written by Kay Ewbank   
Monday, 05 January 2015

Google has released a Java software development kit (SDK) for Cloud Dataflow, its cloud analytics system launched in 2014.

 

Announcing the SDK on the Google Cloud Platform blog, Sam McVeety, Software Engineer at Google said that the SDK will make it “easier for developers to integrate with our managed service while also forming the basis for porting Cloud Dataflow to other languages and execution environments.”

The blog post says Google hopes that the developer community will create apps that combine stream and batch based processing models. The Cloud Dataflow SDK introduces a unified model for this, along with a set of windowing primitives that mean the same computations can be used with batch or stream based data sources.

Google also hopes developers will use the SDK to execute Dataflow on other service environments, commenting that “As Storm, Spark, and the greater Hadoop family continue to mature, developers are challenged with bifurcated programming models. We hope to relieve developer fatigue and enable choice in deployment platforms by supporting execution and service portability.”

Google also plans to add support for other languages in SDKs, and is already working on a Python 3 version. The Java version can be downloaded from GitHub.

 

googlecloudsq

More Information

Google Announces Open-Source Cloud Dataflow SDK for Java

Cloud Dataflow

DataflowJavaSDK on GitHub

Related Articles

Google Moves On From MapReduce, Launches Cloud Dataflow

 

To be informed about new articles on I Programmer, install the I Programmer Toolbar, subscribe to the RSS feed, follow us on, Twitter, FacebookGoogle+ or Linkedin,  or sign up for our weekly newsletter.

 

Banner


Rust Is A Top Twenty Language
03/06/2020

Yes, yes it's all nonsense, but it's fun and has some sort of correlation with something and if you are a Rust fan, count me in, you can use it to browbeat others - Rust is in TIOBE's top twenty langu [ ... ]



TileDB Improves Sparse Array Support
25/05/2020

Designed to give data scientists a more powerful way to store, update, analyze, and share large sets of data, TileDB lets you model data as either a dense or a sparse multi-dimensional array. New [ ... ]


More News

 

graphics

 



 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Monday, 05 January 2015 )