Skip to content

Can Khiops be deployed on Hadoop? #520

Answered by lucaurelien
lucaurelien asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, Khiops has been used for several years in production on Hadoop using specialized tools we have developed. However, these components are not yet part of its open-source distribution.

Overview of the technical solution

We have developed the following components to enable Khiops integration with Hadoop ecosystems:

  • HDFS driver for Khiops: A dedicated driver enabling Khiops to read and write directly from HDFS, with full support for multipart files;
  • Khiops4Spark: A Spark wrapper that enables Khiops to run on Hadoop clusters through spark-submit command;
  • Khiops4Yarn: A Yarn wrapper that allows Khiops to operate directly within Hadoop’s resource management framework.

Khiops, as a native M…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by bruno-at-orange
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant