2
2
3
3
The onet library allows for multiple levels of simulations:
4
4
5
- - localhost:
6
-
5
+ - [ Localhost] ( ./platform/LOCALHOST.md ) :
7
6
- up to 100 nodes
8
-
9
- - mininet:
10
-
7
+ - [ Mininet] ( ./platform/MININET.md ) :
11
8
- up to 300 nodes on a 48-core machine, multiplied by the number of machines
12
9
available
13
-
14
10
- define max. bandwidth and delay for your network
15
-
16
- - deterlab:
17
-
11
+ - [ Deterlab] ( ./platform/DETERLAB.md ) :
18
12
- up to 1000 nodes on a strong machine, multiplied by the number of machines
19
13
available
20
14
21
- Refer to the simulation-examples in simul/manage/simulation and
22
- < https://github.com/dedis/cothority_template >
15
+ Refer to the simulation-examples in one of the following places:
16
+ - [ ./manage/simulation] ( ./manage/simulation )
17
+ - [ ./test_simul] ( ./test_simul )
18
+ - https://github.com/dedis/cothority_template
23
19
24
20
## Runfile for simulations
25
21
@@ -35,16 +31,62 @@ experiment, where each experiment makes up one line.
35
31
### Necessary variables
36
32
37
33
- ` Simulation ` - what simulation to run
38
- - ` Hosts ` - how many hosts to instantiate
39
- - ` Servers ` - how many servers to use
34
+ - ` Hosts ` - how many hosts to instantiate - this corresponds to the nodes
35
+ that will be running and available in the main ` Roster `
36
+ - ` Servers ` - how many servers to use maximum - if less than this number of
37
+ servers are available, a warning will be printed, but the simulation will
38
+ still be run
39
+
40
+ The ` Servers ` will mostly influence how the simulation will be run.
41
+ Depending on the platform, this will be handled differently:
42
+ - ` localhost ` - ` Servers ` is ignored here
43
+ - ` Deterlab ` - the system will distribute the ` Hosts ` nodes over the
44
+ available servers, but not over more than ` Servers ` .
45
+ This allows for running simulations that are smaller than your whole DETERLab experiment without having to modify and restart the
46
+ experiment.
47
+ - ` Mininet ` - as in ` Deterlab ` , the ` Hosts ` nodes will be distributed over
48
+ a maximum of ` Servers ` .
40
49
41
50
### onet.SimulationBFTree
42
51
43
- If you use the ` onet.SimulationBFTree ` , the following variables are also available:
52
+ The standard simulation (and the only one implemented) is the
53
+ ` SimulationBFTree ` , which will prepare the ` Roster ` and the ` Tree ` for the
54
+ simulation.
55
+ Even if you use the ` SimulationBFTree ` , you're not restricted to use only the
56
+ prepared ` Tree ` .
57
+ However, there will not be more nodes available than the ones in the prepared
58
+ ` Roster ` .
59
+ Some restrictions apply when you're using the ` Deterlab ` simulation:
60
+ - all nodes on one server (` Hosts ` / min(available servers, ` Servers ` )) are
61
+ run in one binary, which means
62
+ - bandwidth measurements cover all the nodes
63
+ - time measurements need to make sure no other calculations are taking place
64
+ - the bandwidth- and delay-restrictions only apply between two physical servers, so
65
+ - the simulation makes sure that all connected nodes in the ` Tree ` are always
66
+ on different servers. If you use another communication than the one in the
67
+ ` Tree ` , this will mean that the system cannot guarantee that the
68
+ communication is restricted
69
+ - the bandwidth restrictions apply to the sum of all communications between
70
+ two servers, so to a number of hosts
71
+ If you want to have a bandwidth restriction that is between all nodes, and
72
+ ` Hosts > Servers ` , you have to use the ` Mininet ` platform, which doesn't
73
+ have this restriction.
74
+
75
+ The following variables define how the original ` Tree ` is calculated - only
76
+ one of the two should be given:
44
77
45
78
- ` BF ` - branching factor: how many children each node has
46
79
- ` Depth ` - the depth of the tree in levels below the root-node
47
- - ` Rounds ` - for how many rounds the simulation should run
80
+
81
+ If there are 13 ` Hosts ` with a ` BF ` of 3, the system will create a complete
82
+ tree with the root-node having 3 children, and each of the children having 3
83
+ more children.
84
+ The same setup can be achieved with 13 ` Hosts ` and a ` Depth ` of 3.
85
+
86
+ If the tree to be created is not complete, it will be filled breath-first and
87
+ the children of the last row will be distributed as evenly as possible.
88
+
89
+ In addition, ` Rounds ` defines how many rounds the simulation will run.
48
90
49
91
### Statistics for subset of hosts
50
92
@@ -164,3 +206,67 @@ Alternatively, it can be set for each individual experiment:
164
206
7,100
165
207
15,200
166
208
31,400
209
+
210
+ ## test_data format
211
+
212
+ Every simulation will be written to the ` test_data ` directory with the name
213
+ of the simulation file as base and a ` .csv ` applied.
214
+ The configuration of the simulation file is written to the tables in the
215
+ following columns, which are copied as-is from the simulation file:
216
+
217
+ - hosts, bf, delay, depth, other, prescript, ratio, rounds, servers, suite
218
+
219
+ For all the other measurements, the following statistics are available:
220
+
221
+ - ` _avg ` - the average
222
+ - ` _std ` - standard-deviation
223
+ - ` _min ` - minimum
224
+ - ` _max ` - maximum
225
+ - ` _sum ` - sum of all calls
226
+
227
+ ### measure.NewTimeMeasure
228
+
229
+ The following measurements will be taken for ` measure.NewTimeMeasure ` :
230
+ - ` _user ` - user-space time, crypto and other calculations
231
+ - ` _system ` - system-space time - disk i/o network i/o
232
+ - ` _wall ` - wall-clock, as described above
233
+
234
+ The measurements are given in seconds.
235
+ There is an important difference in the ` _wall ` and the ` _user ` /` _system `
236
+ measurements: the ` _wall ` measurements indicate how much time an external
237
+ observer would have measured.
238
+ So if the system waits for a reply of the network, this waiting time is
239
+ included in the measurement.
240
+ Contrary to this, the ` _user ` /` _system ` measures how much work has been done
241
+ by the CPU during the measurement.
242
+ When measuring parallel execution of code, it is possible that the
243
+ ` _user ` /` _system ` measurements are bigger than the ` _wall ` measurements
244
+ , because more than one CPU participated in the calculation.
245
+ The difference in ` _user ` /` _system ` is explained for example here:
246
+ https://stackoverflow.com/questions/556405/what-do-real-user-and-sys-mean-in-the-output-of-time1
247
+ The ` _wall ` corresponds to the ` real ` in this comment.
248
+
249
+ There are some standard time measurements done by the simulation:
250
+ - ` ChildrenWait ` - how long the system had to wait for all children to be
251
+ available - might show problems in setting up the servers
252
+ - ` SimulSyncWait ` - how long the system had to wait at the end of the
253
+ simulation - might indicate problems in the wrap-up of the simulation
254
+
255
+ ### measure.NewCounterIOMeasure
256
+
257
+ If you want to measure bandwidth, you can use ` measure.NewCounterIOMeasure ` .
258
+ But you have to be careful to make sure that the system will not include
259
+ traffic that is outside of your scope by putting the ` .Record() ` as close as
260
+ possible to the ` NewCounterIOMeasure ` .
261
+ Every ` CounterIOMeasure ` has the following statistics:
262
+
263
+ - ` _tx ` - transmission-bytes
264
+ - ` _rx ` - bytes received
265
+ - ` _msg_tx ` - packets transmitted
266
+ - ` _msg_rx ` - packets received
267
+
268
+ Plus the standard modifiers (` _avg ` , ` _std ` , ...).
269
+
270
+ There are two standard measurements done by every simulation:
271
+ - ` bandwidth ` (empty) - all node bandwidth
272
+ - ` bandwidth_root ` - bandwidth of the first node of the roster
0 commit comments