-1

I am deploying my program in spark cluster and I need to give each node a specific list of data that I decide on. How can I do this? I created an RDD object out of my data but I don't know how to pass the specific part of data to each node.

Ley Big
  • 1
  • 1

2 Answers2

0

I don't think, you can pass a specific list to the node. If your data have unique keys, then you can use hash technique to send same keys on specific partition

0

Not possible as you have no control which Worker Nodes are allocated, and, N Executors may be on same Worker Node.

thebluephantom
  • 11,806
  • 6
  • 26
  • 54