Skip to content

Commit f15728b

Browse files
author
Jeff R. Allen
authored
Merge pull request #649 from dedis/simul_docu
Updating documentation for simulations and .ns-file
2 parents 64b58cc + 539f333 commit f15728b

File tree

3 files changed

+133
-20
lines changed

3 files changed

+133
-20
lines changed

simul/README.md

+121-15
Original file line numberDiff line numberDiff line change
@@ -2,24 +2,20 @@
22

33
The onet library allows for multiple levels of simulations:
44

5-
- localhost:
6-
5+
- [Localhost](./platform/LOCALHOST.md):
76
- up to 100 nodes
8-
9-
- mininet:
10-
7+
- [Mininet](./platform/MININET.md):
118
- up to 300 nodes on a 48-core machine, multiplied by the number of machines
129
available
13-
1410
- define max. bandwidth and delay for your network
15-
16-
- deterlab:
17-
11+
- [Deterlab](./platform/DETERLAB.md):
1812
- up to 1000 nodes on a strong machine, multiplied by the number of machines
1913
available
2014

21-
Refer to the simulation-examples in simul/manage/simulation and
22-
<https://github.com/dedis/cothority_template>
15+
Refer to the simulation-examples in one of the following places:
16+
- [./manage/simulation](./manage/simulation)
17+
- [./test_simul](./test_simul)
18+
- https://github.com/dedis/cothority_template
2319

2420
## Runfile for simulations
2521

@@ -35,16 +31,62 @@ experiment, where each experiment makes up one line.
3531
### Necessary variables
3632

3733
- `Simulation` - what simulation to run
38-
- `Hosts` - how many hosts to instantiate
39-
- `Servers` - how many servers to use
34+
- `Hosts` - how many hosts to instantiate - this corresponds to the nodes
35+
that will be running and available in the main `Roster`
36+
- `Servers` - how many servers to use maximum - if less than this number of
37+
servers are available, a warning will be printed, but the simulation will
38+
still be run
39+
40+
The `Servers` will mostly influence how the simulation will be run.
41+
Depending on the platform, this will be handled differently:
42+
- `localhost` - `Servers` is ignored here
43+
- `Deterlab` - the system will distribute the `Hosts` nodes over the
44+
available servers, but not over more than `Servers`.
45+
This allows for running simulations that are smaller than your whole DETERLab experiment without having to modify and restart the
46+
experiment.
47+
- `Mininet` - as in `Deterlab`, the `Hosts` nodes will be distributed over
48+
a maximum of `Servers`.
4049

4150
### onet.SimulationBFTree
4251

43-
If you use the `onet.SimulationBFTree`, the following variables are also available:
52+
The standard simulation (and the only one implemented) is the
53+
`SimulationBFTree`, which will prepare the `Roster` and the `Tree` for the
54+
simulation.
55+
Even if you use the `SimulationBFTree`, you're not restricted to use only the
56+
prepared `Tree`.
57+
However, there will not be more nodes available than the ones in the prepared
58+
`Roster`.
59+
Some restrictions apply when you're using the `Deterlab` simulation:
60+
- all nodes on one server (`Hosts` / min(available servers, `Servers`)) are
61+
run in one binary, which means
62+
- bandwidth measurements cover all the nodes
63+
- time measurements need to make sure no other calculations are taking place
64+
- the bandwidth- and delay-restrictions only apply between two physical servers, so
65+
- the simulation makes sure that all connected nodes in the `Tree` are always
66+
on different servers. If you use another communication than the one in the
67+
`Tree`, this will mean that the system cannot guarantee that the
68+
communication is restricted
69+
- the bandwidth restrictions apply to the sum of all communications between
70+
two servers, so to a number of hosts
71+
If you want to have a bandwidth restriction that is between all nodes, and
72+
`Hosts > Servers`, you have to use the `Mininet` platform, which doesn't
73+
have this restriction.
74+
75+
The following variables define how the original `Tree` is calculated - only
76+
one of the two should be given:
4477

4578
- `BF` - branching factor: how many children each node has
4679
- `Depth` - the depth of the tree in levels below the root-node
47-
- `Rounds` - for how many rounds the simulation should run
80+
81+
If there are 13 `Hosts` with a `BF` of 3, the system will create a complete
82+
tree with the root-node having 3 children, and each of the children having 3
83+
more children.
84+
The same setup can be achieved with 13 `Hosts` and a `Depth` of 3.
85+
86+
If the tree to be created is not complete, it will be filled breath-first and
87+
the children of the last row will be distributed as evenly as possible.
88+
89+
In addition, `Rounds` defines how many rounds the simulation will run.
4890

4991
### Statistics for subset of hosts
5092

@@ -164,3 +206,67 @@ Alternatively, it can be set for each individual experiment:
164206
7,100
165207
15,200
166208
31,400
209+
210+
## test_data format
211+
212+
Every simulation will be written to the `test_data` directory with the name
213+
of the simulation file as base and a `.csv` applied.
214+
The configuration of the simulation file is written to the tables in the
215+
following columns, which are copied as-is from the simulation file:
216+
217+
- hosts, bf, delay, depth, other, prescript, ratio, rounds, servers, suite
218+
219+
For all the other measurements, the following statistics are available:
220+
221+
- `_avg` - the average
222+
- `_std` - standard-deviation
223+
- `_min` - minimum
224+
- `_max` - maximum
225+
- `_sum` - sum of all calls
226+
227+
### measure.NewTimeMeasure
228+
229+
The following measurements will be taken for `measure.NewTimeMeasure`:
230+
- `_user` - user-space time, crypto and other calculations
231+
- `_system` - system-space time - disk i/o network i/o
232+
- `_wall` - wall-clock, as described above
233+
234+
The measurements are given in seconds.
235+
There is an important difference in the `_wall` and the `_user`/`_system`
236+
measurements: the `_wall` measurements indicate how much time an external
237+
observer would have measured.
238+
So if the system waits for a reply of the network, this waiting time is
239+
included in the measurement.
240+
Contrary to this, the `_user`/`_system` measures how much work has been done
241+
by the CPU during the measurement.
242+
When measuring parallel execution of code, it is possible that the
243+
`_user`/`_system` measurements are bigger than the `_wall` measurements
244+
, because more than one CPU participated in the calculation.
245+
The difference in `_user`/`_system` is explained for example here:
246+
https://stackoverflow.com/questions/556405/what-do-real-user-and-sys-mean-in-the-output-of-time1
247+
The `_wall` corresponds to the `real` in this comment.
248+
249+
There are some standard time measurements done by the simulation:
250+
- `ChildrenWait` - how long the system had to wait for all children to be
251+
available - might show problems in setting up the servers
252+
- `SimulSyncWait` - how long the system had to wait at the end of the
253+
simulation - might indicate problems in the wrap-up of the simulation
254+
255+
### measure.NewCounterIOMeasure
256+
257+
If you want to measure bandwidth, you can use `measure.NewCounterIOMeasure`.
258+
But you have to be careful to make sure that the system will not include
259+
traffic that is outside of your scope by putting the `.Record()` as close as
260+
possible to the `NewCounterIOMeasure`.
261+
Every `CounterIOMeasure` has the following statistics:
262+
263+
- `_tx` - transmission-bytes
264+
- `_rx` - bytes received
265+
- `_msg_tx` - packets transmitted
266+
- `_msg_rx` - packets received
267+
268+
Plus the standard modifiers (`_avg`, `_std`, ...).
269+
270+
There are two standard measurements done by every simulation:
271+
- `bandwidth` (empty) - all node bandwidth
272+
- `bandwidth_root` - bandwidth of the first node of the roster

simul/platform/DETERLAB.md

+3-1
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,9 @@ Before a successful Deterlab simulation, you need to
1818
1. be signed up at Deterlab. If you're working with DEDIS, ask your
1919
responsible for the _Project Name_ and the _Group Name_.
2020
2. create a simulation defined by an NS-file. You can find a simple
21-
NS-file here: [cothority.ns](./deterlab_users/cothority.ns)
21+
NS-file here: [cothority.ns](./deterlab_users/cothority.ns) - you'll need to
22+
adjust the # of servers, the type of servers, and the bandwidth- and delay
23+
restrictions.
2224
3. swap the simulation in
2325

2426
For point 3. it is important of course that Deterlab has enough machines

simul/platform/deterlab_users/cothority.ns

+9-4
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,9 @@
11
set ns [new Simulator]
22
source tb_compat.tcl
33

4+
# Set you number of servers here, as well s the delay.
5+
# The bandwidth can be set below.
6+
47
set server_count 30
58
set server_delay 100ms
69
set lanstr ""
@@ -9,14 +12,16 @@ set lanstr ""
912

1013
for {set i 0} {$i < $server_count} {incr i} {
1114
set server($i) [$ns node]
12-
#tb-set-hardware $server($i) dl380g3
13-
#tb-set-hardware $server($i) MicroCloud
14-
tb-set-hardware $server($i) pc2133
15+
# Uncomment the server hardware you're using here, or add your own
16+
#tb-set-hardware $server($i) dl380g3
17+
#tb-set-hardware $server($i) MicroCloud
18+
tb-set-hardware $server($i) pc2133
1519
tb-set-node-os $server($i) Ubuntu1404-64-STD
1620
append server_lanstr "$server($i) "
1721
}
1822

19-
23+
# Here you can set your bandwidth restrictions by replacing the 144Mb with the
24+
# expected bandwidth
2025
tb-use-endnodeshaping 1
2126
set serverlan [$ns make-lan "$server_lanstr" 144Mb $server_delay]
2227

0 commit comments

Comments
 (0)