mostly java, python. some C/C++ machine learning focused. both web/text data and spatial/image data lot of distributed system work (Hadoop, Cluster computing, etc) linux