Local dev #27

scotmatson · 2016-03-16T23:35:23Z

Made changes that fixed an error due to the mixture of tabs with white spaces. But the biggest change involved making modifications for adding support for both Python2 and Python3. This included dropping the unicode encoding in html_parser, adding an updated from of the StringIO module, and adding a few additional parenthesis that were missing from various print statements.

I would like to note that I've been testing only the HTML parsing with this pull request. Nothing else should be effected but I am still learning the build of this utility. I've been testing html pages from malware-traffic-analysis.net which is pulling in many FPs - something I plan on playing with the future.

Finally I made the default PDF parser PyPDF2 as it has python2&3 support where pdfminer does not.

… Python2 syntax is now the fallback.

…parse properly from what I can tell and now ioc_parser fully supports Python2 and Python3 - at least for HTML.

scotmatson · 2016-03-17T03:09:12Z

Was reading the Issue response regarding PyPDF2 vs. pdfminer. I understand the reason behind sticking with pdfminer, but feel it would be worthwhile to implementing a solution that addresses Python3 problems out of the box as well.

sjpm · 2016-03-29T16:03:52Z

iocp.py

@@ -35,12 +35,16 @@
 #
 ###################################################################################################

+#from __future__ import unicode_literals


Remnants of a change that was not kept. This line has no real purpose any longer and should not be merged.

scotmatson added 6 commits March 16, 2016 14:31

Removed trailing whitespace.

b91527d

Changed tab into space. Fixed TabError.

3f2304e

Removed trailing whitespace.

8665424

Modified StringIO import statement to support Python3 as the default.…

8bb8982

… Python2 syntax is now the fallback.

Added missing parenthesis on print statements.

aece74a

Removed unicode encoding from html_parser function. HTML files still …

8dc8e7b

…parse properly from what I can tell and now ioc_parser fully supports Python2 and Python3 - at least for HTML.

sjpm reviewed Mar 29, 2016
View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local dev #27

Local dev #27

scotmatson commented Mar 16, 2016

scotmatson commented Mar 17, 2016

sjpm Mar 29, 2016

Local dev #27

Are you sure you want to change the base?

Local dev #27

Conversation

scotmatson commented Mar 16, 2016

scotmatson commented Mar 17, 2016

sjpm Mar 29, 2016

Choose a reason for hiding this comment