@@ -29,7 +29,7 @@ ivory_repository/
29
29
│ └── stores
30
30
│ ├── feature_store1
31
31
│ └── feature_store2
32
- └── fact_sets
32
+ └── factsets
33
33
├── fact_set1
34
34
└── fact_set2
35
35
```
@@ -74,8 +74,8 @@ my_fact_set/
74
74
75
75
```
76
76
77
- In this fact set, facts are partioned across two namespaces: ` widgets ` and ` demo ` . The * widget* facts
78
- are spread accross three dates, while * demographic* facts are constrained to one. Note also that
77
+ In this fact set, facts are partitioned across two namespaces: ` widgets ` and ` demo ` . The * widget* facts
78
+ are spread across three dates, while * demographic* facts are constrained to one. Note also that
79
79
a given namespace-partition can contain multiple EAVT files.
80
80
81
81
EAVT files are simply pipe-delimited text files with one EAVT record per line. For example, a line in
@@ -112,10 +112,10 @@ The ordering is important as it allows facts to be overriden. When a feature sto
112
112
with the same entity, attribute and time are identified, the value from the fact contained in the most recent fact
113
113
set will be used, where most recent means listed higher in the feature store file.
114
114
115
- Because a feature store can be speified by just referencing fact sets, Ivory can support poor-man versioning giving
115
+ Because a feature store can be specified by just referencing fact sets, Ivory can support poor-man versioning giving
116
116
rise to use cases such as:
117
117
118
- * overrding buggy values with corrected ones;
118
+ * overriding buggy values with corrected ones;
119
119
* combining * production* features with * ad-hoc* features.
120
120
121
121
@@ -138,7 +138,7 @@ feature identifier the following metadata:
138
138
139
139
* A human-readable * description* .
140
140
141
- In Ivory, feature metadata is seperated from the features store (facts) in its own set of text files known
141
+ In Ivory, feature metadata is separated from the features store (facts) in its own set of text files known
142
142
as * feature dictionaries* . Dictionary text files are also pipe-delimited and of the following form:
143
143
144
144
```
@@ -165,7 +165,7 @@ check that the encoding types specified for features in the dictionary are consi
165
165
> ivory validate --feature-store feature_store.txt --dictionary feature_dictionary.txt
166
166
```
167
167
168
- We can also use Ivory to generate statistics for the values of specific features accross a feature store using the
168
+ We can also use Ivory to generate statistics for the values of specific features across a feature store using the
169
169
` inspect ` command. This will compute statistics such as density, ranges (for numerical features), factors (for
170
170
categorical features), historgrams, means, etc. Inspections can filter both the features of interest as well which
171
171
facts to considered by time:
@@ -181,7 +181,7 @@ Querying
181
181
Ivory supports two types of queries: * snapshots* and * chords* .
182
182
183
183
184
- A ` snaphot ` query is used to extract the feature values for entities at a certain point in time. Snapshoting can filter
184
+ A ` snapshot ` query is used to extract the feature values for entities at a certain point in time. Snapshotting can filter
185
185
the set of features and/or entities considered. By default the output is in * EAVT* format, but can be output in
186
186
row-oriented form (i.e. column per feature) using the ` --pivot ` option. When a ` snapshot ` query is performed, the most
187
187
recent feature value for a given entity-attribute, with respect to the snapshot time, will be returned in the output:
@@ -233,7 +233,7 @@ This outputs two files:
233
233
The format of the feature flag file is:
234
234
235
235
```
236
- namespace|name|sparcity|fequency
236
+ namespace|name|sparcity|frequency
237
237
```
238
238
239
239
An example is:
@@ -258,4 +258,4 @@ Versioning
258
258
The format of fact sets are versioned. This allows the format of fact sets to be modified in the future but still maintain feature stores that
259
259
reference fact sets persisted in an older format.
260
260
261
- A fact set format version is specifed by a ` .version ` file that is stored at the root directory of a given fact set.
261
+ A fact set format version is specified by a ` .version ` file that is stored at the root directory of a given fact set.
0 commit comments