Skip to content

pszmyd/Dryad

 
 

Repository files navigation

Dryad

This is a research prototype of the Dryad and DryadLINQ data-parallel 
processing frameworks running on Hadoop YARN.  Dryad utilizes cluster 
services provided as part of Hadoop YARN to reliably execute 
distributed computations on a cluster of computers. DryadLINQ provides
the LINQ programming model for distributed data processing and leverages 
Dryad for reliable execution.

Dryad and DryadLINQ on YARN are still under active development.  

If you just want to run a sample DryadLINQ program, the simplest way to get started
is to follow the instructions at 
http://microsoftresearchsvc.github.io/Dryad/?topic=html/e992fd94-c956-481d-82e6-dbdf45daa722.htm

Requirements

Visual Studio 2012
An Azure HDInsight 3.0 cluster or a Windows YARN cluster composed of x64 machines

Building Dryad

1) Clone the Dryad git repository.
2) Set the DRYAD_HOME environment variable to binary path
   (bin\Debug or bin\Release) under the directory Dryad was cloned to.
3) Use Visual Studio to open The Dryad solution file (Dryad.sln) located 
   in the root of the repository and build the solution.  The dependencies
   will be fetched from NuGet.


Notes

The HDFS implementation in Dryad currently only supports text files, although Azure Blobs can
be used to store any serialized data type.

About

This is a research prototype of the Dryad and DryadLINQ data-parallel processing frameworks running on Hadoop YARN.

Resources

License

Stars

Watchers

Forks

Packages

No packages published