-
Notifications
You must be signed in to change notification settings - Fork 81
/
__init__.py
110 lines (96 loc) · 3.55 KB
/
__init__.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
"""This is the 3W Toolkit, a software package written in Python 3 that
is one of the 3W Project's major components.
This toolkit contains resources that make the following easier:
- 3W Dataset overview generation;
- Experimentation and comparative analysis of Machine Learning-based
approaches and algorithms for specific problems related to undesirable
events that occur in offshore oil wells during their respective
production phases;
* Standardization of key points of the Machine Learning-based algorithm
development pipeline.
All these resources are implemented in the following sub-modules:
- **base**: groups the objects used by the other sub-modules;
- **dev**: has all the resources related to development of Machine
Learning models;
- **misc**: brings together diverse resources that do not fit in the
other sub-modules;
- **rolling_window**: creates a view of array which for every point
gives the n-dimensional neighbourhood of size window. New dimensions are
added at the end of array or after the corresponding original dimension.
Specific problems will be incorporated into this toolkit gradually. At
this time, models can be developed for the following problems:
- Binary Classifier of Spurious Closure of DHSV.
Examples of how to use this toolkit will be incremented throughout its
development. Please, check the 3W Project's README.md file for more details.
It is important to note that there are arbitrary choices in this
toolkit, but they have been carefully made to allow adequate comparative
analysis without compromising the ability to experiment with different
approaches and algorithms.
This toolkit's documentation is generated in english and in Google format
with [autoDocstring - Python Docstring Generator
](https://github.com/NilsJPWerner/autoDocstring), which follows [PEP 257
](https://peps.python.org/pep-0257/), and [pdoc3
](https://pdoc3.github.io/pdoc/).
Its source code is implemented according to the style guide established
by [PEP 8](https://peps.python.org/pep-0008/). This is guaranteed with
the use of the [Black formatter](https://github.com/psf/black).
"""
__status__ = "Development"
__version__ = "1.1.0"
__license__ = "Apache License 2.0"
__copyright__ = "Copyright 2024, Petróleo Brasileiro S.A."
__authors__ = [
"Ricardo Emanuel Vaz Vargas",
"Bruno Guberfain do Amaral",
"Jean Carlos Dias de Araújo",
"Lucas Pierezan Magalhães",
]
__maintainer__ = ["Ricardo Emanuel Vaz Vargas <[email protected]>"]
# Imports and exposes objects from the 'base' sub-module
from .base import (
CLASS,
COLUMNS_DATA_FILES,
COLUMNS_DESCRIPTIONS,
DATASET_INI,
DATASET_VERSION,
EVENT_NAMES,
EVENT_NAMES_DESCRIPTIONS,
EVENT_NAMES_LABELS,
EVENT_NAMES_OBSERVATION_LABELS,
EXTRA_INSTANCES_TRAINING,
EventType,
LABELS_DESCRIPTIONS,
NORMAL_LABEL,
PARQUET_EXTENSION,
PARQUET_ENGINE,
PARQUET_COMPRESSION,
PATH_3W_PROJECT,
PATH_DATASET,
PATH_DATASET_INI,
PATH_FOLDS,
PATH_TOOLKIT,
TRANSIENT_OFFSET,
VARS,
load_config_in_dataset_ini,
)
# Imports and exposes objects from the 'dev' sub-module
from .dev import (
Experiment,
EventFold,
EventFolds,
)
# Imports and exposes objects from the 'misc' sub-module
from .misc import (
calc_stats_instances,
count_properties_instance,
count_properties_instances,
create_and_plot_scatter_map,
create_table_of_instances,
filter_rare_undesirable_events,
get_all_labels_and_files,
label_and_file_generator,
load_instance,
load_instances,
resample,
plot_instance,
)