# doris_benchmark **Repository Path**: w_fsm/doris_benchmark ## Basic Information - **Project Name**: doris_benchmark - **Description**: doris数据压测 - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 1 - **Forks**: 0 - **Created**: 2025-03-20 - **Last Updated**: 2026-02-13 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # doris_benchmark #### 介绍 doris 数据压测 #### 使用说明 1. 建表 生成指定列数的表,默认生成 unique 类型的表,也可以生成 duplicate 类型的表。 建议将输出重定向到.sql 文件中,然后进入 doris shell 控制台是用 source 命令导入。 ``` Generate create table SQL Usage: doris-mock-data create [flags] Flags: --batch-name string Batch name for table and data generation (default "20250320") --extra-columns int Number of extra columns in the table (default 1) -h, --help help for create --mode string table type,such as unique or duplicate (default "unique") ``` 2. 批量建表 ``` Batch create tables Usage: doris-mock-data batch-create [flags] Flags: --batch int Number of batches to create (default 1) --extra-columns int Number of extra columns in the table (default 1) -h, --help help for batch-create --mode string table type,such as unique or duplicate (default "unique") ``` 3. 生成模拟数据,文件数量代表写入的总批次数,文件每行数据代表每批写入的数据条数。针对 unique 表模型可以设置数据重复率,以模拟更新场景。 ``` Generate mock data Usage: doris-mock-data mock [flags] Flags: --batch-name string Batch name for table and data generation (default "20250320") --extra-columns int Number of extra columns in the table (default 1) --file-count int Number of files to split the generated data into (default 5) -h, --help help for mock --record-per-file int Number of records to generate in each file (default 1000) --uk-id-duplication-rate float Duplication rate for uk_id column (default 0.2) ``` 4. 批量生成模拟数据,用于导入多个表的场景。 ``` Batch generate mock data Usage: doris-mock-data batch-mock [flags] Flags: --batch int Number of batches to generate (default 1) --extra-columns int Number of extra columns in the table (default 1) --file-count int Number of files to split the generated data into (default 5) -h, --help help for batch-mock --record-per-file int Number of records to generate in each file (default 1000) --uk-id-duplication-rate float Duplication rate for uk_id column (default 0.2) ``` 5. 数据导入,将模拟数据生成的文件进行导入,可以设置并发度。 ``` Import data to Doris Usage: doris-mock-data import [flags] Flags: --batch-name string Batch name for table and data generation (default "20250320") --concurrency int Concurrency level for data import (default 3) --db string db name (default "test") -h, --help help for import --ip string Doris IP address (default "127.0.0.1") --password string Doris password --port string Doris port (default "8030") --user string Doris username (default "root") ``` 6. 批量数据导入,针对多表导入的场景 ``` Batch import data to Doris Usage: doris-mock-data batch-import [flags] Flags: --batch int Number of batches to import (default 1) --concurrency int Concurrency level for data import (default 3) --db string db name (default "test") -h, --help help for batch-import --ip string Doris IP address (default "127.0.0.1") --password string Doris password --port string Doris port (default "8030") --user string Doris username (default "root") ``` 7. 单表导入的数据质量校验 ``` Check data quality Usage: doris-mock-data check [flags] Flags: --batch-name string Batch name for table and data generation (default "20250320") --db string db name (default "test") -h, --help help for check --ip string Doris IP address (default "127.0.0.1") --mode string table type,such as unique or duplicate (default "unique") --password string Doris password --port string Doris port (default "9030") --user string Doris username (default "root") ``` 8. 多表导入的数据质量校验 ``` Batch check data quality Usage: doris-mock-data batch-check [flags] Flags: --batch int Number of batches to check (default 1) --db string db name (default "test") -h, --help help for batch-check --ip string Doris IP address (default "127.0.0.1") --mode string table type,such as unique or duplicate (default "unique") --password string Doris password --port string Doris port (default "9030") --user string Doris username (default "root") ```