Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AUtoML dataset #2

Open
priyanga24 opened this issue May 20, 2020 · 0 comments
Open

AUtoML dataset #2

priyanga24 opened this issue May 20, 2020 · 0 comments

Comments

@priyanga24
Copy link

Hi Shashank,
I tried to use AutoML with a dataset that contains features extracted from PE files.
image
This is how basically my dataset will look like. I ran this with autogluon and it gave me good results. Is it true that autogluon can process datasets like these?
The one shown below is the data in one single cell under a column name 'datadir'. like these there are many rows and there are many other columns too which have subset of data like these.
[{'name': 'EXPORT_TABLE', 'size': 0, 'virtual_address': 0}, {'name': 'IMPORT_TABLE', 'size': 200, 'virtual_address': 35312}, {'name': 'RESOURCE_TABLE', 'size': 28672, 'virtual_address': 352256}, {'name': 'EXCEPTION_TABLE', 'size': 0, 'virtual_address': 0}, {'name': 'CERTIFICATE_TABLE', 'size': 0, 'virtual_address': 0}, {'name': 'BASE_RELOCATION_TABLE', 'size': 0, 'virtual_address': 0}, {'name': 'DEBUG', 'size': 0, 'virtual_address': 0}, {'name': 'ARCHITECTURE', 'size': 0, 'virtual_address': 0}, {'name': 'GLOBAL_PTR', 'size': 0, 'virtual_address': 0}, {'name': 'TLS_TABLE', 'size': 0, 'virtual_address': 0}, {'name': 'LOAD_CONFIG_TABLE', 'size': 72, 'virtual_address': 35240}, {'name': 'BOUND_IMPORT', 'size': 0, 'virtual_address': 0}, {'name': 'IAT', 'size': 660, 'virtual_address': 32768}, {'name': 'DELAY_IMPORT_DESCRIPTOR', 'size': 0, 'virtual_address': 0}, {'name': 'CLR_RUNTIME_HEADER', 'size': 0, 'virtual_address': 0}]
When i tried to run autogluon itself created many features and gave me good accuracy.
Is this working efficiently and correct? or i should change my data? Because autogluon's important characteristic is that it will itself take care of dataset like these. Please respond asap.thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant