[VL] window function error #6600

FelixYBW · 2024-07-26T01:53:29Z

Backend

VL (Velox)

Bug description

24/07/26 00:58:09 ERROR [Driver] datasources.FileFormatWriter: Aborting job c89ecb16-71d7-49b6-8963-cd58dd9873e1.
org.apache.spark.sql.catalyst.analysis.UnresolvedException: Invalid call to dataType on unresolved object
	at org.apache.spark.sql.catalyst.analysis.UnresolvedAttribute.dataType(unresolved.scala:160)
	at org.apache.spark.sql.types.StructType$.$anonfun$fromAttributes$1(StructType.scala:548)
	at scala.collection.TraversableLike.$anonfun$map$1(TraversableLike.scala:238)
	at scala.collection.Iterator.foreach(Iterator.scala:941)
	at scala.collection.Iterator.foreach$(Iterator.scala:941)
	at scala.collection.AbstractIterator.foreach(Iterator.scala:1429)
	at scala.collection.IterableLike.foreach(IterableLike.scala:74)
	at scala.collection.IterableLike.foreach$(IterableLike.scala:73)
	at scala.collection.AbstractIterable.foreach(Iterable.scala:56)
	at scala.collection.TraversableLike.map(TraversableLike.scala:238)
	at scala.collection.TraversableLike.map$(TraversableLike.scala:231)
	at scala.collection.AbstractTraversable.map(Traversable.scala:108)

the write columns are all string and bigint. except one date column which is used as partition column.

@JkSelf is the date columun the reason?

Spark version

Spark-3.2.x

Spark configurations

No response

System information

No response

Relevant logs

No response

The text was updated successfully, but these errors were encountered:

LoseYSelf · 2024-07-26T06:12:19Z

what is the sql and the table metadata?

FelixYBW · 2024-07-27T07:00:07Z

The error is caused by window function:

sum(a) OVER (
        PARTITION BY b,
        c
        ORDER BY date RANGE BETWEEN 6 preceding AND CURRENT ROW
      )

zml1206 · 2024-07-30T06:34:36Z

Can you assign this issue to me, I want to try to solve it, thank you. @FelixYBW

JkSelf · 2024-07-30T08:24:41Z

I have disabled the date type in #6637. @zml1206, if you have the time, you are welcome to continue supporting the date type in window range frame. Thank you for your contributions.

FelixYBW · 2024-07-30T16:33:44Z

@JkSelf Is there dependency of your current streaming window PR?

FelixYBW · 2024-08-10T03:15:14Z

@JkSelf Looks the issue isn't fixed by 6600

zml1206 · 2024-08-10T03:36:11Z

incubator-gluten/backends-velox/src/test/scala/org/apache/gluten/execution/TestOperator.scala

Line 377 in 51d0a37

checkSparkOperatorMatch[WindowExecTransformer]

The UT verification here has passed. Are there any special circumstances that have been missed? Can you provide the test sql? @FelixYBW

FelixYBW · 2024-08-10T06:00:44Z

@JkSelf Did you verify the SQL I shared? failed from my test.

JkSelf · 2024-08-12T01:58:41Z

@FelixYBW I tested the fix in my local environment and confirmed that the issue has been resolved. However, I encountered the following exception during execution select sum(impression_count_last7days ) from ( SELECT sum(impression_count) OVER ( PARTITION BY user_id, surface ORDER BY date RANGE BETWEEN 6 preceding AND CURRENT ROW ) AS impression_count_last7days from d233 )"

24/08/12 16:13:26 WARN TaskSetManager: Lost task 86.1 in stage 3.0 (TID 263) (sr246 executor 8): TaskKilled (Stage cancelled: Job aborted due to stage failure: Task 90 in stage 3.0 failed 4 times, most recent failure: Lost task 90.3 in stage 3.0 (TID 254) (sr246 executor 6): java.lang.NullPointerException
        at org.apache.gluten.expression.ExpressionMappings$.expressionsMap(ExpressionMappings.scala:341)
        at org.apache.gluten.expression.ExpressionConverter$.replaceWithExpressionTransformer(ExpressionConverter.scala:54)
        at org.apache.gluten.expression.ExpressionConverter.replaceWithExpressionTransformer(ExpressionConverter.scala)
        at org.apache.gluten.substrait.expression.WindowFunctionNode.setBound(WindowFunctionNode.java:102)
        at org.apache.gluten.substrait.expression.WindowFunctionNode.toProtobuf(WindowFunctionNode.java:180)
        at org.apache.gluten.substrait.rel.WindowRelNode.toProtobuf(WindowRelNode.java:77)
        at org.apache.gluten.substrait.rel.ProjectRelNode.toProtobuf(ProjectRelNode.java:69)
        at org.apache.gluten.substrait.rel.AggregateRelNode.toProtobuf(AggregateRelNode.java:89)
        at org.apache.gluten.substrait.plan.PlanNode.toProtobuf(PlanNode.java:74)
        at org.apache.gluten.backendsapi.velox.VeloxIteratorApi.genFinalStageIterator(VeloxIteratorApi.scala:238)
        at org.apache.gluten.execution.WholeStageZippedPartitionsRDD.$anonfun$compute$1(WholeStageZippedPartitionsRDD.scala:59)
        at org.apache.gluten.utils.Arm$.withResource(Arm.scala:25)
        at org.apache.gluten.metrics.GlutenTimeMetric$.millis(GlutenTimeMetric.scala:37)
        at org.apache.gluten.execution.WholeStageZippedPartitionsRDD.compute(WholeStageZippedPartitionsRDD.scala:46)
        at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:367)
        at org.apache.spark.rdd.RDD.iterator(RDD.scala:331)
        at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
        at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:367)
        at org.apache.spark.rdd.RDD.iterator(RDD.scala:331)
        at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
        at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:367)
        at org.apache.spark.rdd.RDD.iterator(RDD.scala:331)
        at org.apache.spark.shuffle.ShuffleWriteProcessor.write(ShuffleWriteProcessor.scala:59)
        at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:104)
        at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:54)
        at org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:166)
        at org.apache.spark.scheduler.Task.run(Task.scala:141)
        at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:620)
        at org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally(SparkErrorUtils.scala:64)
        at org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally$(SparkErrorUtils.scala:61)
        at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:94)
        at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:623)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:750)

@zml1206 Do you have time to look at this issue? If not, I can follow up. Thanks.

zml1206 · 2024-08-12T02:12:20Z

Do you have time to look at this issue? If not, I can follow up. Thanks.

I'll try to reproduce and solve it. @JkSelf

FelixYBW · 2024-08-14T05:47:51Z

fixed by #6803

FelixYBW added bug Something isn't working triage labels Jul 26, 2024

FelixYBW changed the title ~~[VL] parquet write error~~ [VL] window function error Jul 27, 2024

github-actions bot mentioned this issue Jul 30, 2024

[GLUTEN-6600][VL] Disable date type in window range frame support #6637

Closed

FelixYBW assigned zml1206 Jul 30, 2024

github-actions bot mentioned this issue Jul 31, 2024

[GLUTEN-6600][VL] Support date type in window range frame #6653

Merged

JkSelf closed this as completed in #6653 Aug 1, 2024

FelixYBW reopened this Aug 10, 2024

github-actions bot mentioned this issue Aug 13, 2024

[GLUTEN-6600]Fix NPE issue when running window sql #6803

Merged

FelixYBW closed this as completed Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VL] window function error #6600

[VL] window function error #6600

FelixYBW commented Jul 26, 2024

LoseYSelf commented Jul 26, 2024

FelixYBW commented Jul 27, 2024

zml1206 commented Jul 30, 2024

JkSelf commented Jul 30, 2024

FelixYBW commented Jul 30, 2024

FelixYBW commented Aug 10, 2024

zml1206 commented Aug 10, 2024

FelixYBW commented Aug 10, 2024

JkSelf commented Aug 12, 2024 •

edited

Loading

zml1206 commented Aug 12, 2024

FelixYBW commented Aug 14, 2024

[VL] window function error #6600

[VL] window function error #6600

Comments

FelixYBW commented Jul 26, 2024

Backend

Bug description

Spark version

Spark configurations

System information

Relevant logs

LoseYSelf commented Jul 26, 2024

FelixYBW commented Jul 27, 2024

zml1206 commented Jul 30, 2024

JkSelf commented Jul 30, 2024

FelixYBW commented Jul 30, 2024

FelixYBW commented Aug 10, 2024

zml1206 commented Aug 10, 2024

FelixYBW commented Aug 10, 2024

JkSelf commented Aug 12, 2024 • edited Loading

zml1206 commented Aug 12, 2024

FelixYBW commented Aug 14, 2024

JkSelf commented Aug 12, 2024 •

edited

Loading