From c455caec07a0e700c39d0abcc21f5406c3747899 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Jingcheng=20Wu=EF=BC=88=E5=90=B4=20=E7=92=9F=E6=89=BF?= =?UTF-8?q?=EF=BC=89?= <42027179+Wu-Jingcheng@users.noreply.github.com> Date: Fri, 17 Jun 2022 00:37:45 -0700 Subject: [PATCH] Update pagerank_on_sona_en.md Fixed a `Chinese Leakage` issue in the document. --- docs/algo/sona/pagerank_on_sona_en.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/algo/sona/pagerank_on_sona_en.md b/docs/algo/sona/pagerank_on_sona_en.md index ec1bafcfc..b46d9112a 100644 --- a/docs/algo/sona/pagerank_on_sona_en.md +++ b/docs/algo/sona/pagerank_on_sona_en.md @@ -26,8 +26,8 @@ We implemented large-scale PageRank calculation based on Spark On Angel, where p ### Resource parameters - - Angel PS number and memory: The product of ps.instance and ps.memory is the total configuration memory of ps. In order to ensure that Angel does not hang, you need to configure memory about twice the size of the model. For PageRank, the calculation formula of the model size is: number of nodes * 3 * 4 Byte, according to which you can estimate the size of ps memory that needs to be configured under Graph input of different sizes - - Spark的资源配置:The product of num-executors and executor-memory is the total configuration memory of executors, and it is best to store twice the input data. If the memory is tight, 1 times is also acceptable, but relatively slower. For example, a 10 billion edge set is about 160G in size, and a 20G * 20 configuration is sufficient. In a situation where resources are really tight, try to increase the number of partitions! + - Angel configurations of PS number and memory: The product of ps.instance and ps.memory is the total configuration memory of ps. In order to ensure that Angel does not hang, you need to configure memory about twice the size of the model. For PageRank, the calculation formula of the model size is: number of nodes * 3 * 4 Byte, according to which you can estimate the size of ps memory that needs to be configured under Graph input of different sizes + - Spark configurations:The product of num-executors and executor-memory is the total configuration memory of executors, and it is best to store twice the input data. If the memory is tight, 1 times is also acceptable, but relatively slower. For example, a 10 billion edge set is about 160G in size, and a 20G * 20 configuration is sufficient. In a situation where resources are really tight, try to increase the number of partitions! ### Submitting scripts ```