Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Searching references when looking for terms in the pdf files #14

Open
kellijohnson-NOAA opened this issue Mar 1, 2022 · 4 comments
Open
Assignees
Labels
question Further information is requested

Comments

@kellijohnson-NOAA
Copy link
Collaborator

@Bai-Li-NOAA I noticed that unfished|virgin|equilibrium comes up just once in noaa_17252_DS1.pdf so I went to the pdf to see where it occurred and it was in the reference section. Do you think that we should try to eliminate searching the entire document or just not worry about it? The trouble of eliminating the reference section is that it is often in the middle of the document. I vote for just not worrying about it and mentioning it in the Discussion section or something along those lines. But, I wanted to get other's thoughts.

@kellijohnson-NOAA kellijohnson-NOAA added the question Further information is requested label Mar 1, 2022
@chantelwetzel-noaa
Copy link
Collaborator

If omitting the reference section is too difficult, and it could be especially since we will want to scan sections that occur after the references (tables and figures), perhaps we don't worry about it. We could make strategic decisions on how to present the key word search information. If we opt to only use a word cloud, then terms that are only found once or a few will likely not be seen. Alternatively, if we opt to have a table of key terms we could impose a lower bound on items to include that could also deal with this.

I have also been wondering if we should increase the number of assessments summarized. I initially only grabbed a 2-3 from each region thinking we were going to use them to guide us to make decisions on what terms to include in our glossary, but if we want to present this as more of a robust synthesis we may want to add more assessments. What do others think?

@Bai-Li-NOAA
Copy link
Contributor

Bai-Li-NOAA commented Mar 2, 2022 via email

kellijohnson-NOAA added a commit that referenced this issue Mar 2, 2022
remove the numbers of times the words were found b/c we might
change the documents that are included or what portion of those
documents that are searched, i.e., #14.

Remove the reference to SAM

Specify that initial cannot be searched for.

Include rationale on why using unfished.
@chantelwetzel-noaa
Copy link
Collaborator

I went to Stock Smart and pulled additional assessment documents for all Science Centers. The number of assessments for some Science Centers were limited by those available (PIFSC and SWFSC) but I tried to grab a large selection across a range of species. I have added the following files onto the google drive folder ("Assessment Docs"):

AFSC: 16
NWFSC: 18
NEFSC: 16
PIFSC: 8
SEFSC: 22
SWFSC: 4

@Bai-Li-NOAA
Copy link
Contributor

Bai-Li-NOAA commented Mar 3, 2022 via email

@kellijohnson-NOAA kellijohnson-NOAA added this to the Draft Manuscript milestone Mar 4, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants