-
Notifications
You must be signed in to change notification settings - Fork 6
/
design.txt
70 lines (49 loc) · 1.76 KB
/
design.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
Flow
init - collect all files with name <source>, write it into a single file
device identification - add device info as the first keyword in the logline
preprocess - slice log files with more than <linePerBatch> into several files
param detection - process each file through LogPai to detect parameters and log templates
templates building
if (template is not recognized)
param identification - attemps to recognize parameters produced by logpai with stanford NLP and regex
NER rules building - read config and create rules for Stanford NLP
template KG building - represent the log templates in turtle and (manually) add meaning to
ottr template - represent the log templates as OTTR
ottr building
ottr instance - conversion of log line
ttl conversion - running lutra engine to convert ottr instances
Config
# general
sourceFolder: root folder where we want to search for source files
source: file that we want to process e.g., sys.log
linePerBatch: number of lines that we want to process per batch (bigger files will be splitted)
isOverride: true if we should remove existing data
# logpai
logpaiFormat: # e.g., "<Device> <Month> <Date> <Time> <Type> <Component>: <Content>"
# template building
paramExtractAttempt: # how many log events will used as sample for identify parameter type
Logpai
logFormat:
================
executable
output
<source>
1-init
<source>.[i]
2-logpai
<source>.[i]_structured.csv
<source>.[i]_templates.csv
<source>.error
3-ottr
<source>.stottr
<source>.[i].ottr
4-turtle
<source>.error
<source>.instances.ttl
<source>.ner.rules
<source>.templates.ttl
timer.csv
src
main
java
resources