[GLUTEN-5320][VL] Reduce driver memory footprint by postpone the creation and serialization of LocalFilesNode #5321

WangGuangxin · 2024-04-08T08:31:57Z

What changes were proposed in this pull request?

Currently, driver generate GlutenPartition based on spark's FilePartitions, and then convert to LocalFilesNode and serialized to byte array in pb format.
This will double the driver memory, because the FilePartitions are not destroyed after convert to LocalFilesNodes.
When there are many file splits ( file status) , the impact is significant.

For example, in one of our case, there are total 48 hdfs paths to list, 7039474 files under them. With vanilla spark, it can work with driver memory = 20G, but failed in Gluten.

From the gc log, we can find that Gluten has more String and Byte[] objects than vanilla spark.

Vanilla Spark Full GC objects

 num     #instances         #bytes  class name
----------------------------------------------
   1:      42535479     8856286272  [C
   2:      42538104     1020914496  java.lang.String
   3:       7044015      563521200  java.net.URI
   4:       7039474      506842128  org.apache.hadoop.fs.LocatedFileStatus
   5:         13412      332304008  [B
   6:       7039474      281578960  org.apache.spark.sql.execution.datasources.PartitionedFile
   7:       7040016      225280512  scala.collection.mutable.LinkedHashSet$Entry
   8:       7039542      225265344  scala.collection.mutable.LinkedEntry
   9:       7039479      225263328  org.apache.hadoop.fs.permission.FsPermission
  10:          1412      151374272  [Lscala.collection.mutable.HashEntry;
  11:           145      125501688  [Lorg.apache.hadoop.fs.FileStatus;
  12:       7039625      112634000  org.apache.hadoop.fs.Path
  13:         55673       42854960  [Ljava.lang.Object;
  14:        146968       30759312  [Lorg.apache.spark.sql.execution.datasources.PartitionedFile;
  15:          2462       27069520  [J
  16:       1004712       24113088  java.util.concurrent.ConcurrentSkipListMap$Node
  17:        146968       16460416  org.apache.spark.scheduler.ResultTask
  18:        791929       12670864  scala.Some

Gluten Full GC objects (before this patch)

num     #instances         #bytes  class name
----------------------------------------------
   1:      70600217     9596405088  [C
   2:        153749     2117256784  [B
   3:      70603033     1694472792  java.lang.String
   4:      28210146      902724672  java.util.HashMap$Node
   5:       7056556      564282560  [Ljava.util.HashMap$Node;
   6:       7044001      563520080  java.net.URI
   7:       7039474      506842128  org.apache.hadoop.fs.LocatedFileStatus
   8:       7054771      338629008  java.util.HashMap
   9:       7039496      225263872  scala.collection.mutable.LinkedEntry
  10:       7039479      225263328  org.apache.hadoop.fs.permission.FsPermission
  11:       7040463      168971112  java.lang.Long
  12:        777126      135040840  [Ljava.lang.Object;
  13:       7039578      112633248  org.apache.hadoop.fs.Path
  14:          1332       67224064  [Lscala.collection.mutable.HashEntry;
  15:            97       56405176  [Lorg.apache.hadoop.fs.FileStatus;
  16:        748173       17956152  java.util.ArrayList
  17:        593611       14246664  scala.collection.immutable.$colon$colon
  18:          1919        9036728  [J

Gluten Full GC objects (after this patch)

num     #instances         #bytes  class name
----------------------------------------------
   1:      50009922    11752807376  [C
   2:      49812651     1195503624  java.lang.String
   3:       7043968      563517440  java.net.URI
   4:       7039474      506842128  org.apache.hadoop.fs.LocatedFileStatus
   5:       7039474      394210544  org.apache.spark.util.HadoopFSUtils$SerializableFileStatus
   6:         26766      259720056  [B
   7:       7039479      225263328  org.apache.hadoop.fs.permission.FsPermission
   8:       7039572      112633152  org.apache.hadoop.fs.Path
   9:         45775       68452656  [Ljava.lang.Object;
  10:       1573313       50346016  scala.collection.mutable.LinkedHashSet$Entry
  11:          1304       33665792  [Lscala.collection.mutable.HashEntry;
  12:         14435       15252040  [I
  13:            13        6756208  [Lorg.apache.hadoop.fs.FileStatus;
  14:        167935        5373920  java.util.concurrent.ConcurrentHashMap$Node
  15:        122916        3933312  java.util.Hashtable$Entry
  16:         31958        3531872  java.lang.Class
  17:         97118        3107776  scala.collection.mutable.ArrayBuilder$ofRef
  18:         97117        3107744  java.net.URI$Parser

(Fixes: #5320)

github-actions · 2024-04-08T08:32:13Z

#5320

github-actions · 2024-04-08T08:32:31Z

Run Gluten Clickhouse CI

WangGuangxin · 2024-04-08T08:52:36Z

There are still some cases to fix, for example:

velox backend with iceberg format
clickhouse backend(which is not planed in this PR)
But it works now on velox backend with parquet/orc format.
Appreciate your comments in advance if you have some concerns about the interface change. cc @zhztheplayer @Yohahaha @ulysses-you @liujiayi771

liujiayi771 · 2024-04-08T09:00:41Z

backends-velox/src/main/scala/org/apache/gluten/backendsapi/velox/IteratorApiImpl.scala

@@ -44,6 +42,7 @@ import org.apache.spark.util.ExecutorManager
 import java.lang.{Long => JLong}
 import java.nio.charset.StandardCharsets
 import java.time.ZoneOffset
+import java.util


Not introduce this package. Just use JArrayList.

Yohahaha · 2024-04-08T13:13:14Z

gluten-core/src/main/java/org/apache/gluten/substrait/rel/RawSplitInfo.java

+  public List<String> preferredLocations() {
+    return Arrays.asList(filePartition.preferredLocations());
+  }


val preferredLocations = SoftAffinity.getFilePartitionLocations(f)

please keep origin logic.

Yohahaha · 2024-04-08T13:20:19Z

gluten-core/src/main/scala/org/apache/gluten/backendsapi/IteratorApi.scala

@@ -91,4 +91,6 @@ trait IteratorApi {
      numOutputRows: SQLMetric,
      numOutputBatches: SQLMetric,
      scanTime: SQLMetric): RDD[ColumnarBatch]
+
+  def toLocalFilesNodeByteArray(p: GlutenRawPartition): Array[Array[Byte]]


could we add a new SplitInfo object file and move this method into it with toSplitInfoByteArray? then other backends could use it more easily, and avoid add this method in IteratorApi which seems unrelated.

Yohahaha · 2024-04-08T13:23:46Z

thank you for the improvements, this idea works for me, just few comments.

github-actions · 2024-05-24T01:45:39Z

This PR is stale because it has been open 45 days with no activity. Remove stale label or comment or this will be closed in 10 days.

github-actions · 2024-06-03T01:48:09Z

This PR was auto-closed because it has been stalled for 10 days with no activity. Please feel free to reopen if it is still valid. Thanks.

Yohahaha · 2024-06-03T01:55:20Z

@WangGuangxin are you still working on this PR?

WangGuangxin · 2024-06-03T23:37:27Z

@WangGuangxin are you still working on this PR?

@Yohahaha I'll rework on this this week.

Yohahaha · 2024-07-24T06:14:45Z

Hi @WangGuangxin
Since the current PR has not been updated for a while, I have submitted a new PR based on your code, addressing conflicts and comments. You can find it here #6572.

Feel free to request close my PR if yours is ready to review.

Reduce driver memory footprint

428f351

liujiayi771 reviewed Apr 8, 2024

View reviewed changes

Yohahaha reviewed Apr 8, 2024

View reviewed changes

github-actions bot added the stale stale label May 24, 2024

github-actions bot closed this Jun 3, 2024

Yohahaha mentioned this pull request Jul 24, 2024

[GLUTEN-5320][VL] Reduce driver memory footprint by postpone the creation and serialization of LocalFilesNode #6572

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GLUTEN-5320][VL] Reduce driver memory footprint by postpone the creation and serialization of LocalFilesNode #5321

[GLUTEN-5320][VL] Reduce driver memory footprint by postpone the creation and serialization of LocalFilesNode #5321

WangGuangxin commented Apr 8, 2024

github-actions bot commented Apr 8, 2024

github-actions bot commented Apr 8, 2024

WangGuangxin commented Apr 8, 2024

liujiayi771 Apr 8, 2024

Yohahaha Apr 8, 2024

Yohahaha Apr 8, 2024

Yohahaha commented Apr 8, 2024

github-actions bot commented May 24, 2024

github-actions bot commented Jun 3, 2024

Yohahaha commented Jun 3, 2024

WangGuangxin commented Jun 3, 2024

Yohahaha commented Jul 24, 2024

[GLUTEN-5320][VL] Reduce driver memory footprint by postpone the creation and serialization of LocalFilesNode #5321

[GLUTEN-5320][VL] Reduce driver memory footprint by postpone the creation and serialization of LocalFilesNode #5321

Conversation

WangGuangxin commented Apr 8, 2024

What changes were proposed in this pull request?

github-actions bot commented Apr 8, 2024

github-actions bot commented Apr 8, 2024

WangGuangxin commented Apr 8, 2024

liujiayi771 Apr 8, 2024

Choose a reason for hiding this comment

Yohahaha Apr 8, 2024

Choose a reason for hiding this comment

Yohahaha Apr 8, 2024

Choose a reason for hiding this comment

Yohahaha commented Apr 8, 2024

github-actions bot commented May 24, 2024

github-actions bot commented Jun 3, 2024

Yohahaha commented Jun 3, 2024

WangGuangxin commented Jun 3, 2024

Yohahaha commented Jul 24, 2024