02 Oct 22:49

chathurawidanage

bb1de35

Twister2 Release 0.8.0 Latest

Latest

Twister2 Release 0.8.0

This is a major release of Twister2.

You can download source code from Github

Features of this release

Fault Tolerance enhancements
Apache Beam integration is now official Beam Docs
Improvements to configurations
Major TSet API update

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working to,

Improve and release Table API
TSQL; Adding SQL support

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

17 Jul 21:55

chathurawidanage

0.7.0

259fd7e

Twister2 Release 0.7.0

This is a major release of Twister2.

You can download source code from Github

Features of this release

Fault Tolerance enhancements; Automated fault detection and recovery
Table API(experimental)
TSet API improvements; Pipe capability and TSetEnvironement

Minor features

Apart from this, we have done many code improvements and bug fixes.

Next Release

In the next release, we are working to,

Improve and release Table API
TSQL; Adding SQL support

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

17 Feb 19:01

pulasthi

0.6.0

6b8373f

Twister2 Release 0.6.0

This is a major release of Twister2.

You can download source code from Github

Features of this release

Adds support required for the Twister2 Beam runner

Minor features

Apart from this, we have done many code improvements and bug fixes.

Next Release

In the next release, we are working to improve the Fault tolerance, fix issues and integrate with AI
systems.

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

08 Feb 14:55

chathurawidanage

0.5.0

e8755f7

Twister2 Release 0.5.0

This is a major release of Twister2.

You can download source code from Github

Features of this release

Support for UCX
Performance improvements of shuffle operations
Hash Join implementation
Windowing support for TSet API
CSV data readers

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working to improve the Fault tolerance, fix issues and integrate with AI
systems.

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

04 Feb 19:41

chathurawidanage

0.5.0-rc1

e8755f7

Twister2 Release 0.5.0 RC1 Pre-release

Pre-release

This is a pre-release release of Twister2.

You can download source code from Github

Features of this release

Support for UCX
Performance improvements of shuffle operations
Hash Join implementation
Windowing support for TSet API
CSV data readers

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working to improve the Fault tolerance, fix issues and integrate with AI
systems.

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

03 Dec 20:18

chathurawidanage

0.4.0

c904179

Twister2 Release 0.4.0

This is a major release of Twister2.

You can download source code from Github

Features of this release

In this release we moved to OpenMPI 4.0.1 and Python 3. Also we tested Twister2 with JDK 11.

Python API
Fully functioning TSet API
ZooKeeper based automatic restart of workers when failures happen
Improvements to performance including a new routing algorithm for shuffle operations
BEAM integration improvements

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working to improve the Fault tolerance, fix issues and integrate with AI
systems.

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

27 Nov 21:28

chathurawidanage

0.4.0-rc1

90a7c60

Twister2 Release 0.4.0 RC1 Pre-release

Pre-release

Twister2 Release 0.4.0

This is a major release of Twister2.

You can download source code from Github

Features of this release

In this release we moved to OpenMPI 4.0.1 and Python 3. Also we tested Twister2 with JDK 11.

Python API
Fully functioning TSet API
ZooKeeper based automatic restart of workers when failures happen
Improvements to performance including a new routing algorithm for shuffle operations
BEAM integration improvements

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working onto consolidate the Apache Beam integration and improve the
fault tolerance (automatic restart of wokers)

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

06 Sep 19:27

pulasthi

0.3.0

a74e185

Twister2 Release 0.3.0

This is a major release of Twister2.

You can download source code from Github

Features of this release

In this release we moved to OpenMPI 4.0.1 and Python 3. Also we tested Twister2 with JDK 11.

The initial version of Apache BEAM integration
Fully functioning TSet API
Simulator for writing applications with IDE
Organize the APIs to facilitate easy creation of applications
Improvements to performance including a new routing algorithm for shuffle operations
Improved batch task scheduler (new batch scheduler)
Inner joins and outer joins
Support for reading HDFS files through TSet API
The initial version of fault tolerance with manual restart
Configuration structure improvements
Nomad scheduler improvements
New documentation website

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working onto consolidate the Apache Beam integration and improve the
fault tolerance (automatic restart of wokers)

Assets 2

29 Aug 20:07

chathurawidanage

0.3.0-rc2

87c6c66

Twister2 Release 0.3.0-rc2 Pre-release

Pre-release

This is a major release of Twister2.

You can download source code from Github

Features of this release

In this release we moved to OpenMPI 4.0.1 and Python 3. Also we tested Twister2 with JDK 11.

The initial version of Apache BEAM integration
Fully functioning TSet API
Simulator for writing applications with IDE
Organize the APIs to facilitate easy creation of applications
Improvements to performance including a new routing algorithm for shuffle operations
Improved batch task scheduler (new batch scheduler)
Inner joins and outer joins
Support for reading HDFS files through TSet API
The initial version of fault tolerance with manual restart
Configuration structure improvements
Nomad scheduler improvements
New documentation website

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working onto consolidate the Apache Beam integration and improve the
fault tolerance (automatic restart of wokers)

Assets 2

24 Aug 04:07

pulasthi

0.3.0-rc1

e4b2d09

0.3.0-rc1 Pre-release

Pre-release

Twister2 Release 0.3.0-rc1

This is a major release of Twister2.

You can download source code from Github

Features of this release

In this release we moved to OpenMPI 4.0.1 and Python 3. Also we tested Twister2 with JDK 11.

The initial version of Apache BEAM integration
Fully functioning TSet API
Simulator for writing applications with IDE
Organize the APIs to facilitate easy creation of applications
Improvements to performance including a new routing algorithm for shuffle operations
Improved batch task scheduler (new batch scheduler)
Inner joins and outer joins
Support for reading HDFS files through TSet API
The initial version of fault tolerance with manual restart
Configuration structure improvements
Nomad scheduler improvements
New documentation website

Minor features

Apart from these, we have done many code improvements and bug fixes.

Next Release

In the next release we are working onto consolidate the Apache Beam integration and improve the
fault tolerance (automatic restart of wokers)

Components in Twister2

We support the following components in Twister2

Resource provisioning component to bring up and manage parallel workers in cluster environments
1. Standalone
2. Kubernetes
3. Mesos
4. Slurm
5. Nomad
Parallel and Distributed Operators in HPC and Cloud Environments
1. Twister2:Net - a data level dataflow operator library for streaming and large scale batch analysis
2. Harp - a BSP (Bulk Synchronous Processing) innovative collective framework for parallel applications and machine learning at message level
3. OpenMPI (HPC Environments only) at message level
Task System
1. Task Graph
  - Create dataflow graphs for streaming and batch analysis including iterative computations
2. Task Scheduler - Schedule the task graph into cluster resources supporting different scheduling algorithms
  - Datalocality Scheduling
  - Roundrobin scheduling
  - First fit scheduling
3. Executor - Execution of task graph
  - Batch executor
  - Streaming executor
TSet for distributed data representation (Similar to Spark RDD, Flink DataSet and Heron Streamlet)
1. Iterative computations
2. Data caching
APIs for streaming and batch applications
1. Operator API
2. Task Graph based API
3. TSet API
Support for storage systems
1. HDFS
2. Local file systems
3. NFS for persistent storage
Web UI for monitoring Twister2 Jobs
Apache Storm Compatibility API
Apache BEAM API
Connected DataFlow (Experimental)
1. Supports creation of multiple dataflow graphs executing in a single job

Assets 2

Releases: cylondata/twister2