Support explain query when running dfbench with clickbench #13942

zhuqi-lucas · 2024-12-30T02:00:58Z

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Add the debug option for clickbench

Are these changes tested?

yes:

cargo run --release --bin dfbench clickbench  --query 35 --debug
   Compiling datafusion-benchmarks v44.0.0 (/Users/zhuqi/arrow-datafusion/benchmarks)
    Building [=======================> ] 352/353: dfbench(bin)

    Finished `release` profile [optimized] target(s) in 4m 51s
     Running `target/release/dfbench clickbench --query 35 --debug`
Running benchmarks with the following options: RunOpt { query: Some(35), common: CommonOpt { iterations: 3, partitions: None, batch_size: 8192, debug: true }, path: "benchmarks/data/hits.parquet", queries_path: "benchmarks/queries/clickbench/queries.sql", output_path: None }
Q35: SELECT "ClientIP", "ClientIP" - 1, "ClientIP" - 2, "ClientIP" - 3, COUNT(*) AS c FROM hits GROUP BY "ClientIP", "ClientIP" - 1, "ClientIP" - 2, "ClientIP" - 3 ORDER BY c DESC LIMIT 10;
Query 35 iteration 0 took 1186.0 ms and returned 10 rows
Query 35 iteration 1 took 1018.3 ms and returned 10 rows
Query 35 iteration 2 took 970.2 ms and returned 10 rows
+---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| plan_type     | plan                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
+---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| logical_plan  | Sort: c DESC NULLS FIRST, fetch=10                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
|               |   Projection: hits.ClientIP, hits.ClientIP - Int64(1), hits.ClientIP - Int64(2), hits.ClientIP - Int64(3), count(*) AS c                                                                                                                                                                                                                                                                                                                                                                                      |
|               |     Aggregate: groupBy=[[hits.ClientIP, __common_expr_1 AS hits.ClientIP - Int64(1), __common_expr_1 AS hits.ClientIP - Int64(2), __common_expr_1 AS hits.ClientIP - Int64(3)]], aggr=[[count(Int64(1)) AS count(*)]]                                                                                                                                                                                                                                                                                         |
|               |       Projection: CAST(hits.ClientIP AS Int64) AS __common_expr_1, hits.ClientIP                                                                                                                                                                                                                                                                                                                                                                                                                              |
|               |         TableScan: hits projection=[ClientIP]                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
| physical_plan | SortPreservingMergeExec: [c@4 DESC], fetch=10                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
|               |   SortExec: TopK(fetch=10), expr=[c@4 DESC], preserve_partitioning=[true]                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|               |     ProjectionExec: expr=[ClientIP@0 as ClientIP, hits.ClientIP - Int64(1)@1 as hits.ClientIP - Int64(1), hits.ClientIP - Int64(2)@2 as hits.ClientIP - Int64(2), hits.ClientIP - Int64(3)@3 as hits.ClientIP - Int64(3), count(*)@4 as c]                                                                                                                                                                                                                                                                    |
|               |       AggregateExec: mode=FinalPartitioned, gby=[ClientIP@0 as ClientIP, hits.ClientIP - Int64(1)@1 as hits.ClientIP - Int64(1), hits.ClientIP - Int64(2)@2 as hits.ClientIP - Int64(2), hits.ClientIP - Int64(3)@3 as hits.ClientIP - Int64(3)], aggr=[count(*)]                                                                                                                                                                                                                                             |
|               |         CoalesceBatchesExec: target_batch_size=8192                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|               |           RepartitionExec: partitioning=Hash([ClientIP@0, hits.ClientIP - Int64(1)@1, hits.ClientIP - Int64(2)@2, hits.ClientIP - Int64(3)@3], 14), input_partitions=14                                                                                                                                                                                                                                                                                                                                       |
|               |             AggregateExec: mode=Partial, gby=[ClientIP@1 as ClientIP, __common_expr_1@0 - 1 as hits.ClientIP - Int64(1), __common_expr_1@0 - 2 as hits.ClientIP - Int64(2), __common_expr_1@0 - 3 as hits.ClientIP - Int64(3)], aggr=[count(*)]                                                                                                                                                                                                                                                               |
|               |               ProjectionExec: expr=[CAST(ClientIP@0 AS Int64) as __common_expr_1, ClientIP@0 as ClientIP]                                                                                                                                                                                                                                                                                                                                                                                                     |
|               |                 ParquetExec: file_groups={14 groups: [[Users/zhuqi/arrow-datafusion/benchmarks/data/hits.parquet:0..1055712604], [Users/zhuqi/arrow-datafusion/benchmarks/data/hits.parquet:1055712604..2111425208], [Users/zhuqi/arrow-datafusion/benchmarks/data/hits.parquet:2111425208..3167137812], [Users/zhuqi/arrow-datafusion/benchmarks/data/hits.parquet:3167137812..4222850416], [Users/zhuqi/arrow-datafusion/benchmarks/data/hits.parquet:4222850416..5278563020], ...]}, projection=[ClientIP] |
|               |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
+---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+

Are there any user-facing changes?

yes

zhuqi-lucas · 2024-12-30T02:03:12Z

cc @alamb It's a small change for explain support for dfbench, thanks!

2010YOUY01 · 2024-12-30T06:44:48Z

Perhaps we can use an existing common option to print plans

datafusion/benchmarks/src/util/options.rs

Line 40 in ab69bb0

pub debug: bool,

tpch benchmark uses this flag, but it's not used in clickbench yet

datafusion/benchmarks/src/tpch/run.rs

Lines 194 to 232 in ab69bb0

    
           async fn execute_query( 
        
               &self, 
        
               ctx: &SessionContext, 
        
               sql: &str, 
        
           ) -> Result<Vec<RecordBatch>> { 
        
               let debug = self.common.debug; 
        
               let plan = ctx.sql(sql).await?; 
        
               let (state, plan) = plan.into_parts(); 
        
               if debug { 
        
                   println!("=== Logical plan ===\n{plan}\n"); 
        
               } 
        
               let plan = state.optimize(&plan)?; 
        
               if debug { 
        
                   println!("=== Optimized logical plan ===\n{plan}\n"); 
        
               } 
        
               let physical_plan = state.create_physical_plan(&plan).await?; 
        
               if debug { 
        
                   println!( 
        
                       "=== Physical plan ===\n{}\n", 
        
                       displayable(physical_plan.as_ref()).indent(true) 
        
                   ); 
        
               } 
        
               let result = collect(physical_plan.clone(), state.task_ctx()).await?; 
        
               if debug { 
        
                   println!( 
        
                       "=== Physical plan with metrics ===\n{}\n", 
        
                       DisplayableExecutionPlan::with_metrics(physical_plan.as_ref()) 
        
                           .indent(true) 
        
                   ); 
        
                   if !result.is_empty() { 
        
                       // do not call print_batches if there are no batches as the result is confusing 
        
                       // and makes it look like there is a batch with no columns 
        
                       pretty::print_batches(&result)?; 
        
                   } 
        
               } 
        
               Ok(result) 
        
           }

zhuqi-lucas · 2024-12-30T07:46:18Z

Thank you @2010YOUY01 for review and good suggestion! Addressed in latest PR.

alamb

Thank you @zhuqi-lucas and @2010YOUY01 -- looks good to me

FYI @XiangpengHao as I think you have also been using this benchmark program

Support explain query when running dfbench

b08e4ad

Address comments

e8f87c7

zhuqi-lucas changed the title ~~Support explain query when running dfbench~~ Support explain query when running dfbench with clickbench Dec 30, 2024

alamb approved these changes Dec 30, 2024

View reviewed changes

alamb merged commit 4d07579 into apache:main Dec 30, 2024
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support explain query when running dfbench with clickbench #13942

Support explain query when running dfbench with clickbench #13942

zhuqi-lucas commented Dec 30, 2024 •

edited

Loading

zhuqi-lucas commented Dec 30, 2024

2010YOUY01 commented Dec 30, 2024

zhuqi-lucas commented Dec 30, 2024

alamb left a comment

Support explain query when running dfbench with clickbench #13942

Support explain query when running dfbench with clickbench #13942

Conversation

zhuqi-lucas commented Dec 30, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

zhuqi-lucas commented Dec 30, 2024

2010YOUY01 commented Dec 30, 2024

zhuqi-lucas commented Dec 30, 2024

alamb left a comment

Choose a reason for hiding this comment

zhuqi-lucas commented Dec 30, 2024 •

edited

Loading