Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

what authors are in the catalog that aren't in the AAE spreadsheet? #61

Open
cwulfman opened this issue Jan 22, 2018 · 20 comments
Open

Comments

@cwulfman
Copy link

https://docs.google.com/spreadsheets/d/1RHN6KBulDGbpKATLU6PtwU4o5xVsaBB6xbQRtKjMyWE/edit?usp=sharing

@cwulfman
Copy link
Author

Here they are: there are 464 of them.
missing.xml.zip

@AlisonBabeu
Copy link

Thanks so much for this @cwulfman I plan to add them all to the spreadsheet.

@cwulfman
Copy link
Author

Let me know when they're in there!

@cwulfman
Copy link
Author

Oops. Those are just the Greek authors. Let me run the Latin authors, too...

@cwulfman
Copy link
Author

Here are the missing Latin authors.

missing_latin.xml.zip

@AlisonBabeu
Copy link

Hi @cwulfman I think something may have gone a bit wonky with the Latin authors, as I"m finding many of the Latin authors in this list in the AAE spreadsheet (e.g. Acholius, Actorius Naso), in fact as I go through this list, it seems that many of them can be found on the Latin spreadsheet.

@cwulfman
Copy link
Author

Hmm. Not surprising. I'll look at it.

@cwulfman
Copy link
Author

@AlisonBabeu While tracking this down, I've discovered a couple of wonky phi ids:

Novius (author.1000.1) has <mads:identifier type="phi">592.1</mads:identifier> (And the related-work id looks wrong, too)

Hirtius (author.744.1) has <mads:identifier type="phi">530.1</mads:identifier>

@AlisonBabeu
Copy link

Done and done.

@AlisonBabeu
Copy link

Fixed that is. Sigh.

@cwulfman
Copy link
Author

Thanks. I'll get back to this this afternoon.

@cwulfman
Copy link
Author

@AlisonBabeu The STOA field in the Latin Authors spreadsheet is considerably dirtier than the TLG column in the Greek Authors sheet. I'll keep tweaking my scripts to work around it, but I thought you might want to know: there are tons of "invalid ids" in there (i.e., those that don't follow the "stoaN+-stoaN+" pattern). I've attached the XML version, which makes the discrepancies easier to spot using Oxygen.

LatinAuthors.xml.zip](https://github.com/PerseusDL/perseus_catalog/files/1658023/LatinAuthors.xml.zip)

@AlisonBabeu
Copy link

Hi @cwulfman I'm currently going through the list in Oxygen, as one type of example, do you mean the various STOA IDs with letters in them like this "stoa0022-stoa055m", those are invalid in a sense I suppose, but I used letters in a number of cases to stick works alphabetically into authors lists of works (such as Augustine), but then found I couldn't do it consistently. I should have just kept adding works without letters and stuck with numbers but this is of course the problem when one person creates all the IDs.

@cwulfman
Copy link
Author

No-- I mean ids like these:

1
1
516
966
a
a
a
a
a
a
a
a
a
a
n
n
n
n
n
n
n
n
n
n
none?
stoa001
stoa0028a
stoa0032a
stoa0101a-sto001
stoa0101a-sto002
stoa0196c-sto002
stoa0255-various
stoa0304
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0324
stoa0327
stoa0340-unassigned
stoa0341-unassigned
stoa0342
stoa0344
stoa0344
stoa0361a

@perseuscatalog
Copy link
Contributor

perseuscatalog commented Jan 24, 2018 via email

@AlisonBabeu
Copy link

In a few cases these can just be ignored, when there is a top level STOA ID without a separate work level ID, that is often because there isn't one. For example, in the case of stoa0324, that is a top level ID (in this case for the Scriptores Historiae Augustae) where there aren't individual work level IDs just a top level textgroup. For many works/textgroups where there were individual PHI IDs I didn't bother to also create individual STOA Work level IDs as well.

@AlisonBabeu
Copy link

So hi @cwulfman I've gone through the list both in Oxygen and in a Google Sheet and I can't find any "n" or "1" or "a" in any of the STOA ID fields. Am I missing something?

@cwulfman
Copy link
Author

I was doing some pre-processing on those ids, and my pre-processing chopped things up wrong; sorry about that. Here's an improved list:

<STOA_>stoa0361a</STOA_>
<STOA_>stoa0028a.stoa001</STOA_>
<STOA_>stoa0032a</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0304</STOA_>
<STOA_>stoa0344</STOA_>
<STOA_>stoa0101a-sto001</STOA_>
<STOA_>stoa0101a-sto002</STOA_>
<STOA_>516.1</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>n.a.</STOA_>
<STOA_>stoa0327</STOA_>
<STOA_/>
<STOA_/>
<STOA_/>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0196c-sto002</STOA_>
<STOA_>none?</STOA_>
<STOA_>966.1</STOA_>
<STOA_/>
<STOA_/>
<STOA_>stoa0340-unassigned</STOA_>
<STOA_>stoa0341-unassigned</STOA_>
<STOA_>stoa0342</STOA_>
<STOA_/>
<STOA_/>
<STOA_/>
<STOA_>stoa0344</STOA_>
<STOA_/>
<STOA_>stoa0255-various</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_/>
<STOA_>n.a.</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>
<STOA_>stoa0324</STOA_>

@AlisonBabeu
Copy link

Hi @cwulfman I think I fixed all of these. I've changed n.a. to none, fixed a few places where a PHI was in STOA, deleted a few empty rows causing issues, and added work level STOA IDs where relevant. Can you just ignore stoa0324, I don't want to number all of those individual works since they have PHIs. Thanks!

@cwulfman
Copy link
Author

Sure thing. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants