[rust] tokioで様々なasync/awaitの使い方を試してみる

前回のasync/awaitを使ったコードが非常に遅かったので様々な使い方を試してどうすれば早く処理できるか探ってみます。

tokioの基本動作

そもそも適当にasync/awaitなコードを書いたので根本のtokioのランタイム動作から学び直してみます。そこから得られた理解としては、単純にasyncな関数を書いてもその処理は別のスレッドが処理してくれるわけではない、ということ。

tokio::task::spawnなどを使用しないとでスケジューラーがFutureを別スレッドへ割り当ててくれないので効率的に処理してくれない、ということらしいです。

スレッドが配達員、Futureが荷物、tokioランタイムが配達員を管理するマネージャーだとすると、重量の異なるそれぞれの荷物を配達員が目的地まで運ぶことに例えることができるかもしれません。

マネージャーが配達員をアサインして配達員が担当する荷物を運ぶ。しかし、荷物は一人の配達員が終着まで運ぶのではなく途中交代される可能性がある。荷物の配達員の負荷を見てマネージャーがそのあたりを調整する。無理矢理シーケンス図にするとこんな感じだろうか。

spawnは荷物の配送を依頼するようなもの、と理解するとわかりやすいかもしれません。依頼しないとシングルスレッドで動かすことと同じになるのでパフォーマンスが上がらない、というのがざっくりとした理解です。

ということで、spawnを使用して多くのFutureをランタイムへ送り込めば上手い具合に処理してくれるであろうという理解の元、様々なコードを試してみます。ちなみにマルチスレッドプログラミングもやったことはないので手探りです。

再帰処理を使用しないパターン

async関数の中からasync関数を呼ぶとデッドロックしそうなので再帰処理をしないコードで書き直してみました。グローバル変数にディレクトリ情報を管理させることでループ処理にしています。これを基本として様々な書き方を試していきたいと思います。

use anyhow::Result; // 1.0.71
use clap::Parser; // 4.3.11

use tokio::fs; // 1.32.0
use tokio::sync::{Mutex, OnceCell};

#[derive(Parser, Debug)]
#[command(author, version, about, long_about = None)]
struct Args {
    /// target directory
    #[arg(short, long, default_value_t = String::from(".") )]
    dir: String,
}

static LIST_FROM_THD: tokio::sync::OnceCell<tokio::sync::Mutex<Vec<String>>> = OnceCell::const_new();

#[tokio::main]
async fn main() -> Result<()> {
    let args = Args::parse();

    let dir_string = args.dir.to_string();
    println!("{}", dir_string);

    let _ = LIST_FROM_THD.set(Mutex::new(Vec::<String>::new()));
    let mut dir_list;

    // Get lock and push dir
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        lock.push(dir_string);
    }

    // Like a do-while
    while {
        // Get lock and take array as snapshot
        {
            let mut lock = LIST_FROM_THD
                .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
                .await
                .lock()
                .await;
            dir_list = lock.to_vec();
            lock.clear();
        }

        !dir_list.is_empty()
    } {
        let mut dir_list = dir_list.iter();
        while let Some(item) = dir_list.next() {
            let _ = get_dirs(item.to_string()).await?;
        }
    }

    Ok(())
}

async fn get_dirs(dir: String) -> Result<()> {
    let mut entries = fs::read_dir(dir).await?;

    // Folder list
    let mut dirs = Vec::new();

    while let Some(entry) = entries.next_entry().await? {
        let metadata = entry.metadata().await?;
        let path = entry.path();

        if metadata.is_dir() {
            let path = path.display().to_string();
            println!("{}", path);
            dirs.push(path);

            // continue;
        }

        if let Ok(symlink) = fs::read_link(&path).await {
            if path.is_dir() {
                println!("{}@ -> {}", path.display(), symlink.display());
            }
        }
    }

    // Get lock and add array
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        lock.append(&mut dirs);
    }

    Ok(())
}

リファレンスとしてmultitimeで50回ループして計測してみます。ただ後述しますが、環境条件によって大きく値が変わる場合があるので参考としてみておいた方がよいです。

参考としてvCPUが4個の環境です。

ちなみにmtimeというRustポーティングがあるのですが動作は少し怪しかったです。

	Mean	Std.Dev.	Min	Median	Max
real	34.545	0.556	34.078	34.476	38.239
user	7.645	0.697	6.195	7.63	9.068
sys	38.631	0.854	37.218	38.61	42.437

ちなみにasync/awaitを使わないバージョンの場合は、

	Mean	Std.Dev.	Min	Median	Max
real	0.691	0.318	0.631	0.643	2.912
user	0.119	0.023	0.076	0.116	0.176
sys	0.547	0.138	0.455	0.532	1.496

async/awaitバージョンは約60倍遅いですね。

関数内の処理を分割してspawnする

1つのブロックをspawn

最初のコードはget_dirs関数をまるごとasync関数にしてみましたが、get_dirsのループ処理は大きく２つのブロックに分かれるので、そのうちの一つをspawnすることで別スレッドでの処理を狙ってみたいと思います。

spawnする処理で共通する配列に書き込むのでArcでテンポラリの配列を共有することにします。

結果としては、

	Mean	Std.Dev.	Min	Median	Max
real	36.184	0.788	34.639	36.195	38.428
user	7.403	0.751	5.788	7.412	9.604
sys	40.845	1.325	36.89	40.766	44.002

さらに若干遅くなりました。

use anyhow::Result; // 1.0.71
use clap::Parser; // 4.3.11

use std::sync::Arc;

use tokio::fs; // 1.32.0
use tokio::sync::{Mutex, OnceCell};

#[derive(Parser, Debug)]
#[command(author, version, about, long_about = None)]
struct Args {
    /// target directory
    #[arg(short, long, default_value_t = String::from(".") )]
    dir: String,
}

static LIST_FROM_THD: tokio::sync::OnceCell<tokio::sync::Mutex<Vec<String>>> = OnceCell::const_new();

#[tokio::main]
async fn main() -> Result<()> {
    let args = Args::parse();

    let dir_string = args.dir.to_string();
    println!("{}", dir_string);

    let _ = LIST_FROM_THD.set(Mutex::new(Vec::<String>::new()));
    let mut dir_list;

    // Get lock and push dir
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        lock.push(dir_string);
    }

    // Like a do-while
    while {
        // Get lock and take array as snapshot
        {
            let mut lock = LIST_FROM_THD
                .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
                .await
                .lock()
                .await;
            dir_list = lock.to_vec();
            lock.clear();
        }

        !dir_list.is_empty()
    } {
        let mut dir_list = dir_list.iter();
        while let Some(item) = dir_list.next() {
            let _ = get_dirs(item.to_string()).await?;
        }
    }

    Ok(())
}

async fn get_dirs(dir: String) -> Result<()> {
    let mut entries = fs::read_dir(dir).await?;

    // Folder list
    let dirs = Arc::new(Mutex::new(Vec::new()));

    while let Some(entry) = entries.next_entry().await? {
        let metadata = entry.metadata().await?;
        let path = entry.path();

        let _ = tokio::spawn(
            (|path: std::path::PathBuf, dirs: std::sync::Arc<Mutex<Vec<String>>>| async move {
                if metadata.is_dir() {
                    let path = path.display().to_string();
                    println!("{}", path);

                    // Get lock and add array
                    {
                        let mut lock = dirs.lock().await;
                        lock.push(path);
                    }
                    // continue;
                }
            })(path.clone(), Arc::clone(&dirs)),
        )
        .await;

        if let Ok(symlink) = fs::read_link(&path).await {
            if path.is_dir() {
                println!("{}@ -> {}", path.display(), symlink.display());
            }
        }
    }

    // Get lock and add array
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        let mut dirs = dirs.lock().await;
        lock.append(&mut dirs);
    }

    Ok(())
}

2つのブロックをspawn

今度は２つのブロックを両方ともspawnしてみます。

結果としては、

	Mean	Std.Dev.	Min	Median	Max
real	86.773	1.278	83.406	86.947	89.449
user	18.794	0.911	17.289	18.658	20.741
sys	119.049	2.265	112.294	119.183	122.94

めちゃくちゃ遅くなりました。約150倍ぐらいでしょうか。

use anyhow::Result; // 1.0.71
use clap::Parser; // 4.3.11

use std::sync::Arc;

use tokio::fs; // 1.32.0
use tokio::sync::{Mutex, OnceCell};

#[derive(Parser, Debug)]
#[command(author, version, about, long_about = None)]
struct Args {
    /// target directory
    #[arg(short, long, default_value_t = String::from(".") )]
    dir: String,
}

static LIST_FROM_THD: tokio::sync::OnceCell<tokio::sync::Mutex<Vec<String>>> = OnceCell::const_new();

#[tokio::main]
async fn main() -> Result<()> {
    let args = Args::parse();

    let dir_string = args.dir.to_string();
    println!("{}", dir_string);

    let _ = LIST_FROM_THD.set(Mutex::new(Vec::<String>::new()));
    let mut dir_list;

    // Get lock and push dir
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        lock.push(dir_string);
    }

    // Like a do-while
    while {
        // Get lock and take array as snapshot
        {
            let mut lock = LIST_FROM_THD
                .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
                .await
                .lock()
                .await;
            dir_list = lock.to_vec();
            lock.clear();
        }

        !dir_list.is_empty()
    } {
        let mut dir_list = dir_list.iter();
        while let Some(item) = dir_list.next() {
            let _ = get_dirs(item.to_string()).await?;
        }
    }

    Ok(())
}

async fn get_dirs(dir: String) -> Result<()> {
    let mut entries = fs::read_dir(dir).await?;

    // Folder list
    let dirs = Arc::new(Mutex::new(Vec::new()));

    while let Some(entry) = entries.next_entry().await? {
        let metadata = entry.metadata().await?;
        let path = entry.path();

        let _ = tokio::spawn(
            (|path: std::path::PathBuf, dirs: std::sync::Arc<Mutex<Vec<String>>>| async move {
                let path = path.display().to_string();
                if metadata.is_dir() {
                    println!("{}", path);

                    // Get lock and add array
                    {
                        let mut lock = dirs.lock().await;
                        lock.push(path);
                    }
                    //                continue;
                }
            })(path.clone(), Arc::clone(&dirs)),
        )
        .await;

        let _ = tokio::spawn((|| async move {
            if let Ok(symlink) = fs::read_link(&path).await {
                if path.is_dir() {
                    println!("{}@ -> {}", path.display(), symlink.display());
                }
            }
        })())
        .await;
    }

    // Get lock and add array
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        let mut dirs = dirs.lock().await;
        lock.append(&mut dirs);
    }

    Ok(())
}

2つのブロックをspawnしてjoin

今度はjoinを使用してみます。joinを使わないと並行処理にならない様なので２つのブロックのspawnをjoinしてみます。

結果としては、

	Mean	Std.Dev.	Min	Median	Max
real	65.748	1.044	65.102	65.511	72.151
user	10.547	0.475	9.643	10.478	11.71
sys	100.7	1.043	99.184	100.507	105.5

若干早くなりましたがやはり依然として遅いです。

use anyhow::Result; // 1.0.71
use clap::Parser; // 4.3.11

use std::sync::Arc;

use tokio::fs; // 1.32.0
use tokio::sync::{Mutex, OnceCell};

#[derive(Parser, Debug)]
#[command(author, version, about, long_about = None)]
struct Args {
    /// target directory
    #[arg(short, long, default_value_t = String::from(".") )]
    dir: String,
}

static LIST_FROM_THD: tokio::sync::OnceCell<tokio::sync::Mutex<Vec<String>>> = OnceCell::const_new();

#[tokio::main]
async fn main() -> Result<()> {
    let args = Args::parse();

    let dir_string = args.dir.to_string();
    println!("{}", dir_string);

    let _ = LIST_FROM_THD.set(Mutex::new(Vec::<String>::new()));
    let mut dir_list;

    // Get lock and push dir
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        lock.push(dir_string);
    }

    // Like a do-while
    while {
        // Get lock and take array as snapshot
        {
            let mut lock = LIST_FROM_THD
                .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
                .await
                .lock()
                .await;
            dir_list = lock.to_vec();
            lock.clear();
        }

        !dir_list.is_empty()
    } {
        let mut dir_list = dir_list.iter();
        while let Some(item) = dir_list.next() {
            let _ = get_dirs(item.to_string()).await?;
        }
    }

    Ok(())
}

async fn get_dirs(dir: String) -> Result<()> {
    let mut entries = fs::read_dir(dir).await?;

    // Folder list
    let dirs = Arc::new(Mutex::new(Vec::new()));

    while let Some(entry) = entries.next_entry().await? {
        let metadata = entry.metadata().await?;
        let path = entry.path();

        let thd_1 = tokio::spawn(
            (|path: std::path::PathBuf, dirs: std::sync::Arc<Mutex<Vec<String>>>| async move {
                let path = path.display().to_string();
                if metadata.is_dir() {
                    println!("{}", path);

                    // Get lock and add array
                    {
                        let mut lock = dirs.lock().await;
                        lock.push(path);
                    }
                    //                continue;
                }
            })(path.clone(), dirs.clone()),
        );

        let thd_2 = tokio::spawn(async move {
            if let Ok(symlink) = fs::read_link(&path).await {
                if path.is_dir() {
                    println!("{}@ -> {}", path.display(), symlink.display());
                }
            }
        });

        tokio::join!(thd_1, thd_2);
    }

    // Get lock and add array
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        let mut dirs = dirs.lock().await;
        lock.append(&mut dirs);
    }

    Ok(())
}

Futureをコレクションに溜め込んでawaitしてみる

多数のFutureがある場合はどうすればよいでしょうか

Vec等のコレクションにFutureを溜め込み、後にまとめて処理すると並行処理になるということをこちらで書かれていたのでコードを参考にネット上で集めた幾つかのパターンを検証してみます。

ループの中で新規Futureを作ってawaitする、を繰り返す事とFutureをまとめてループでawaitする事は同じ気がするのですが実行結果より違うことがわかります。これの違いが何かよくわかっていませんが溜め込む事が並行処理になるのでこういう書き方のパターンとして覚えておくことにします。

tokio::task::JoinSetとFuturesUnorderedは処理が終了したものから結果を取り出すようです。

join_allとVecとFuturesOrderedは同等の出力結果が得られました。

// [dependencies]
// tokio = { version = "1.32.0", features = ["full"] }
// futures = "0.3"

use futures::future;
// use std::time::Duration;
use tokio::time::{Duration};
use tokio::task;

// use futures::{stream::FuturesUnordered, stream::FuturesOrdered, StreamExt};
use futures::stream::{FuturesUnordered, FuturesOrdered, StreamExt};

// 5秒待って引数をそのまま返す非同期関数
async fn some_heavy_work(id: i64) -> i64 {
    tokio::time::sleep(Duration::from_secs(5)).await;
    id
}

#[tokio::main]
async fn main() {

// 参照
// https://zenn.dev/nojima/articles/30bef27473a6fd

    println!("Case: join_all");
    let now = tokio::time::Instant::now();
    // 1000個の Future を作る (このタイミングでは実行されていない)
    // let works: Vec<_> = (0..1000).map(|id| some_heavy_work(id)).collect();
    let works: Vec<_> = (0..1000).map(|id| task::spawn(some_heavy_work(id))).collect();
    // 1000個の Future を並列実行する
    let ret = future::join_all(works).await;
    let ret: Vec<_> = ret.iter().map(|res| res.as_ref().unwrap()).collect();
    println!("ret = {:?}", ret);

    let duration = now.elapsed();
    println!("duration = {:?}", duration);

///////////////////////////////////////////////////////////////////

    println!("Case: Vec");
    let mut ret = Vec::new();
    let mut thds = Vec::new();
    let now = tokio::time::Instant::now();

    for id in 0..1000 {
        thds.push(task::spawn(some_heavy_work(id)));
    }

    for thd in thds {
        if let Ok(id) = thd.await {
            ret.push(id);
        }
    }
    println!("ret = {:?}", ret);

    let duration = now.elapsed();
    println!("duration = {:?}", duration);

///////////////////////////////////////////////////////////////////

    println!("Case: FuturesUnordered");
    let mut ret = Vec::new();
    let mut thds = FuturesUnordered::new();
    let now = tokio::time::Instant::now();

    for id in 0..1000 {
        thds.push(task::spawn(some_heavy_work(id)));
    }

    // let ret: Vec<_>  = thds.map(|res| res.unwrap()).collect().await;
    while let Some(Ok(id)) = thds.next().await {
        ret.push(id);
    }
    println!("ret = {:?}", ret);

    let duration = now.elapsed();
    println!("duration = {:?}", duration);

///////////////////////////////////////////////////////////////////

    println!("Case: tokio::task::JoinSet");
    let mut ret = Vec::new();
    let mut thds = task::JoinSet::new();
    let now = tokio::time::Instant::now();

    for id in 0..1000 {
        thds.spawn(some_heavy_work(id));
    }

    while let Some(Ok(id)) = thds.join_next().await {
        ret.push(id);
    }
    println!("ret = {:?}", ret);

    let duration = now.elapsed();
    println!("duration = {:?}", duration);

///////////////////////////////////////////////////////////////////

    println!("Case: FuturesOrdered");
    let mut ret = Vec::new();
    let mut thds = FuturesOrdered::new();
    let now = tokio::time::Instant::now();

    for id in 0..1000 {
        thds.push_back(task::spawn(some_heavy_work(id)));
    }

    // let ret: Vec<_>  = thds.map(|res| res.unwrap()).collect().await;
    while let Some(Ok(id)) = thds.next().await {
        ret.push(id);
    }
    println!("ret = {:?}", ret);

    let duration = now.elapsed();
    println!("duration = {:?}", duration);

///////////////////////////////////////////////////////////////////

    println!("Case: spawn and await in loop");
    let mut ret = Vec::new();
    let now = tokio::time::Instant::now();

    // 時間がかかりすぎるので4つまで
    for i in 0..4 {
        let thd = task::spawn(some_heavy_work(i));
        if let Ok(id) = thd.await {
            ret.push(id);
        }
    }
    println!("ret = {:?}", ret);

    let duration = now.elapsed();
    println!("duration = {:?}", duration);

}

Case: join_all
ret = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, ...]
duration = 5.003171598s
Case: Vec
ret = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, ...]
duration = 5.003351963s
Case: FuturesUnordered
ret = [804, 784, 746, 785, 747, 748, 749, 810, 750, 811, 751, 812, 752, 813, 753, 814, 754, 815, 755, 816, 756, 817, 818, 830, 819, 831, 820, 832, 821, 833, 822, 834, 823, 835, 824, 836, 825, 837, 826, 838, 827, 839, 828, 840, 829, 841, 842, 846, 843, 847, 844, 848, 845, 849, 850, 861, 851, 862, 852, 863, 853, 864, 854, 865, 873, 866, 867, 874, 868, 882, 883, 855, 875, 856, 876, 857, 869, 877, 870, ...]
duration = 5.003634026s
Case: tokio::task::JoinSet
ret = [855, 777, 778, 807, 741, 779, 808, 780, 809, 781, 782, 783, 810, 784, 811, 785, 812, 786, 813, 787, 814, 788, 721, 815, 722, 816, 856, 817, 857, 723, 858, 859, 893, 860, 894, 861, 895, 862, 896, 863, 897, 898, 0, 864, 899, 865, 900, 866, 1, 901, 867, 902, 903, 868, 904, 869, 905, 870, 906, 871, 907, 872, 818, 873, 819, 874, 920, 875, 820, 821, 822, 876, 823, 877, 824, 878, 879, 880, 881, 882, ...]
duration = 5.004296013s
Case: FuturesOrdered
ret = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, ...]
duration = 5.003727411s
Case: spawn and await
ret = [0, 1, 2, 3]
duration = 20.006742359s

JoinSetでまとめて処理する

並行処理性を高める書き方が分かったのでspawnできそうな処理をJoinSetにガンガン追加してみました。

その結果、劇的な速度の改善がありました。対象データが増えてもvCPUを増やすことでスケールアウトしていくことができそうです。

と、思ったのですが次章では結果が想定と異なりました。

	Mean	Std.Dev.	Min	Median	Max
real	3.328	0.222	3.139	3.263	4.465
user	2.566	0.189	2.284	2.539	3.378
sys	8.902	0.496	8.34	8.792	11.126

use anyhow::Result; // 1.0.71
use clap::Parser; // 4.3.11

use std::sync::Arc;

use tokio::fs; // 1.32.0
use tokio::sync::{Mutex, OnceCell};
use tokio::task::JoinSet;

#[derive(Parser, Debug)]
#[command(author, version, about, long_about = None)]
struct Args {
    /// target directory
    #[arg(short, long, default_value_t = String::from(".") )]
    dir: String,
}

static LIST_FROM_THD: tokio::sync::OnceCell<tokio::sync::Mutex<Vec<String>>> =
    OnceCell::const_new();

#[tokio::main]
async fn main() -> Result<()> {
    let args = Args::parse();

    let dir_string = args.dir.to_string();
    println!("{}", dir_string);

    let _ = LIST_FROM_THD.set(Mutex::new(Vec::<String>::new()));
    let mut dir_list;

    // Get lock and push dir
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        lock.push(dir_string);
    }

    // Like a do-while
    while {
        // Get lock and take array as snapshot
        {
            let mut lock = LIST_FROM_THD
                .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
                .await
                .lock()
                .await;
            dir_list = lock.to_vec();
            lock.clear();
        }

        !dir_list.is_empty()
    } {
        let mut thds = JoinSet::new();
        let mut dir_list = dir_list.iter();
        while let Some(item) = dir_list.next() {
            thds.spawn(get_dirs(item.to_string()));
        }

        while let Some(thd) = thds.join_next().await {
            let _ = thd;
        }
    }

    Ok(())
}

async fn get_dirs(dir: String) -> Result<()> {
    let mut entries = fs::read_dir(dir).await?;

    // Folder list
    let dirs = Arc::new(Mutex::new(Vec::new()));

    let mut thds = JoinSet::new();

    while let Some(entry) = entries.next_entry().await? {
        let metadata = entry.metadata().await?;
        let path = entry.path();

        thds.spawn(
            (|path: std::path::PathBuf, dirs: std::sync::Arc<Mutex<Vec<String>>>| async move {
                let path = path.display().to_string();
                if metadata.is_dir() {
                    println!("{}", path);

                    // Get lock and add array
                    {
                        let mut lock = dirs.lock().await;
                        lock.push(path);
                    }
                    //                continue;
                }
            })(path.clone(), dirs.clone()),
        );

        thds.spawn(async move {
            if let Ok(symlink) = fs::read_link(&path).await {
                if path.is_dir() {
                    println!("{}@ -> {}", path.display(), symlink.display());
                }
            }
        });

    }

    while let Some(thd) = thds.join_next().await {
        let _ = thd;
    }

    // Get lock and add array
    {
        let mut lock = LIST_FROM_THD
            .get_or_init(|| async move { Mutex::new(Vec::<String>::new()) })
            .await
            .lock()
            .await;
        let mut dirs = dirs.lock().await;
        lock.append(&mut dirs);
    }

    Ok(())
}

VirtualBoxの動作

物理コア=8（vCPU Max 16）の環境だったのでvCPU=8を使うようにVirtualBoxの設定を変更して検証したのですが劇的に遅くなりました。

考えられる可能性としては主に4つ。

vCPU=8だといずれかのCPUでOSなりアプリケーションが動いているので遅いものがあるはずです。VirtualBoxがソレを掴んでしまうのでそこで動くスレッドも遅くなり、結果ボトルネックとなっている可能性。
vCPUが増えたことにより、スケジューラーが各vCPUの動作状況を確認するコスト等が増えて結果として遅くなった可能性。
Linuxカーネルやライブラリ等がかなり古くマルチコアの性能を活かしきれていない環境起因の可能性。
mutexのロック解除待ちが発生している可能性。

極端に遅い場合はVirtualBoxを立ち上げ直してパフォーマンスが変わるかどうか確認したり、vCPUを減らすことも経験上有効なようです。

vCPU=6

	Mean	Std.Dev.	Min	Median	Max
real	4.5	0.259	4.28	4.447	6.13
user	3.798	0.157	3.505	3.784	4.247
sys	17.174	0.808	16.366	17.059	22.184

vCPU=8

	Mean	Std.Dev.	Min	Median	Max
real	18.049	0.99	15.94	17.933	21.145
user	5.738	0.246	5.118	5.739	6.59
sys	85.446	4.443	73.922	85.468	95.867

まとめ

tokioランタイムと組み合わせてasync/awaitを使えば非同期コードを書けることがわかりました。ただし、並列処理をするにはspawnして明示的にスレッドに渡すようにしないといけません。ただspawnするだけではパフォーマンスが出ないので書き方に注意が必要です。JavaScriptとは違い注意が必要です。

またFutureをまとめてspawnすることによってtokioランタイムのスケジューラーが上手い事、並行処理してくれる事がわかりました。ただ、理屈がよく理解できていません。

JoinSetはPromise.all的な使い方ができそうで、グローバル変数をスレッドで共有していますがこれも不要にする書き方ができそうです。

VirtualBoxの動作はちょっと分からない事が多いです。要因が多いのでできるところから潰してボトルネックを特定していくしかなさそうです。

次回はボトルネックの可能性があるMutexを調べてみます。

同じようなコードばかりでスミマセン。

[rust] tokioで様々なasync/awaitの使い方を試してみる

tokioの基本動作

再帰処理を使用しないパターン

関数内の処理を分割してspawnする

1つのブロックをspawn

2つのブロックをspawn

2つのブロックをspawnしてjoin

Futureをコレクションに溜め込んでawaitしてみる

JoinSetでまとめて処理する

VirtualBoxの動作

まとめ

Related Posts