You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is the EMR dedupe geojson job missing options? I am seeing the following errors when trying to run lieu (f55fe8bf232525679baac4c3db1387cb37d16e14) in EMR:
No handlers could be found for logger "mrjob.launch"
Traceback (most recent call last):
File "dedupe_geojson.py", line 204, in <module>
DedupeGeoJSONJob.run()
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 436, in run
mr_job.execute()
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 454, in execute
self.run_spark(self.options.step_num)
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 647, in run_spark
spark_method(input_path, output_path)
File "dedupe_geojson.py", line 137, in spark
use_containing = self.options.use_containing
AttributeError: 'Namespace' object has no attribute 'use_containing'
It's not clear to me if/how/where args are defined or inherited, in part because the use_city = self.options.use_city statement on line 136 does not appear to fail, even it doesn't seem to be defined anywhere either...
No handlers could be found for logger "mrjob.launch"
Traceback (most recent call last):
File "dedupe_geojson.py", line 206, in <module>
DedupeGeoJSONJob.run()
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 436, in run
mr_job.execute()
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 454, in execute
self.run_spark(self.options.step_num)
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 647, in run_spark
spark_method(input_path, output_path)
File "dedupe_geojson.py", line 149, in spark
name_only = self.options.name_only
AttributeError: 'Namespace' object has no attribute 'name_only'
No handlers could be found for logger "mrjob.launch"
Traceback (most recent call last):
File "dedupe_geojson.py", line 213, in <module>
DedupeGeoJSONJob.run()
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 436, in run
mr_job.execute()
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 454, in execute
self.run_spark(self.options.step_num)
File "/usr/local/lib/python2.7/site-packages/mrjob/job.py", line 647, in run_spark
spark_method(input_path, output_path)
File "dedupe_geojson.py", line 184, in spark
with_unit=self.options.with_unit)
TypeError: explain_name_address_dupe() got an unexpected keyword argument 'name_dupe_threshold'
Is the EMR dedupe geojson job missing options? I am seeing the following errors when trying to run lieu (
f55fe8bf232525679baac4c3db1387cb37d16e14
) in EMR:It's not clear to me if/how/where args are defined or inherited, in part because the
use_city = self.options.use_city
statement on line 136 does not appear to fail, even it doesn't seem to be defined anywhere either...The text was updated successfully, but these errors were encountered: