diff --git a/src/prompts/parsePDF.ts b/src/prompts/parsePDF.ts index df34865c..35c54c13 100644 --- a/src/prompts/parsePDF.ts +++ b/src/prompts/parsePDF.ts @@ -5,12 +5,11 @@ I have a text extracted from a PDF file containing a company's annual report and 2. **Handling Units**: Pay close attention to the units and handle them correctly. If emissions are reported in thousands of metric tons (x1,000 ton CO2e), make this clear. Mt CO2e means million ton CO2e. If the figures are on a different scale, such as millions of tons (x1,000,000 ton CO2e), note this but never try to convert units. Also look for any side notes or footnotes that may explain the units. Be very attentive to whether the unit is metric tons (tonnes) or US tons. Always present the data in json even if there are disclaimers in the footnotes. -3. **Data Output Format**: Present the extracted data in a structured JSON format. Include the year, Scope 1, Scope 2, Scope 3, and total emissions for each year. If possible, also include the company's name and organization number in the JSON structure. +3. **Data Output Format**: Present the extracted data in a structured JSON format. Include the year, Scope 1, Scope 2, Scope 3, and total emissions for each year. Example JSON structure: { "companyName": "Example Company", - "organizationNumber": "123456789", "emissions": [ { "year": "2019",