8

I am looking for a Dataflow / Concurrent Programming API for Java.
I know there's DataRush, but it's not free. What I'm interested in specifically is multicore data processing, and not distributed, which rules out MapReduce or Hadoop.
Any thoughts?
Thanks, Rollo

Yishai
  • 84,976
  • 26
  • 176
  • 250
Rollo Tomazzi
  • 3,020
  • 3
  • 26
  • 21

4 Answers4

7

You might try gpars; it apparently has implementations of data flow variables and streams in Java even though it is geared towards providing concurrent programming goodies for Groovy.

sirolf2009
  • 681
  • 7
  • 13
Cagatay
  • 1,342
  • 1
  • 12
  • 16
1

Might try the upcoming fork/join library which will (hopefully) be in Java 7 as part of the JSR 166y update.

Main project page: - http://gee.cs.oswego.edu/dl/concurrency-interest/index.html

Pointers to lots of links about what it is: - http://tech.puredanger.com/java7#jsr166

Alex Miller
  • 65,227
  • 26
  • 112
  • 160
0

Does the built in Java concurrent package meet your needs? It's a very nice package, built in ThreadPools, CopyOnWriteCollections, Executors, Future. We use it to process large volumns of data in thread pools.

Steve K
  • 18,892
  • 6
  • 48
  • 49
0

https://github.com/rfqu/df4j is simple but powerful dataflow library. If it lacks some desired features, they can be added easly. It can exploit java.concurrent.ExecutorService.

Alexei
  • 11
  • 1