Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

new version pandas.io.common.EmptyDataError: No columns to parse from file #28

Open
mariemorel opened this issue Feb 2, 2022 · 0 comments

Comments

@mariemorel
Copy link

mariemorel commented Feb 2, 2022

Dear Carine,

I have recently updated pcoc with the new version and since then, I cannot run some of my datasets anymore. The sedges c3c4 dataset from Besnard et al still runs but I have a rhodopsin dataset that used to run with the previous version and now I get :
Command error:

  Rate 1
  Distribution...........................: Gamma
  Number of classes......................: 4
  WARNING!!! Parameter Gamma.alpha not specified. Default used instead: 1
  Parameter found........................: Gamma.alpha=1
  - Category 0 (Pr = 0.25) rate..........: 0.136954
  - Category 1 (Pr = 0.25) rate..........: 0.476752
  - Category 2 (Pr = 0.25) rate..........: 1
  - Category 3 (Pr = 0.25) rate..........: 2.38629

  Model 1
  Transition model.......................: LGL08_CAT
  Substitution model.....................: LGL08_CAT_C2
  External model frequencies init........: None

  Model 2
  Transition model.......................: LGL08_CAT
  Substitution model.....................: LGL08_CAT_C10
  External model frequencies init........: None

  Root Frequencies Set 1

  Process 1
  Process type...........................: NonHomogeneous
   Model number1 associated to...........: 1978 node(s).
   Model number2 associated to...........: 2114 node(s).
   Tree number...........................: 1
   Rate number...........................: 1
   Root frequencies number...............: 1

  Phylolikelihood 1
   Data used ............................: 1
   Process ..............................: 1

  Result Phylolikelihood
   Result................................: phylo1

  Killed
  2022-02-03 14:19:03,893 - ERROR - No site retained for lessgappy.align_reroot.fa (too many gaps), you can use the "--max_gap_allowed" option.
  Traceback (most recent call last):
    File "/usr/local/bin/pcoc_det.py", line 703, in <module>
      mk_detect(tree_filename, ali_basename, OutDirName)
    File "/usr/local/bin/pcoc_det.py", line 512, in mk_detect
      df_res_l = pool.map(make_estim, set_e1e2)
    File "/usr/lib/python2.7/multiprocessing/pool.py", line 251, in map
      return self.map_async(func, iterable, chunksize).get()
    File "/usr/lib/python2.7/multiprocessing/pool.py", line 567, in get
      raise self._value
  pandas.io.common.EmptyDataError: No columns to parse from file

my command :

S=`cat scenario_pcoc_noacr.txt` 
pcoc_det.py -t tree-rerooted.nhx.txt -aa lessgappy.align_reroot.fa.txt -cpu 4 -o output_pcoc -m $S -f 0.8 -f_oc 0.5 --max_gap_allowed 20 -est_profiles C10

I have tried to discard as many gaps as possible but I don't think it is the issue here. I analyzed my fasta file and the tree and I really cannot see the problem.

In the meanwhile, I am using the old version.

Thank you a lot for your answer and your work!

lessgappy.align_reroot.fa.txt
scenario_pcoc_noacr.txt
tree_rerooted.nhx.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant