diff --git a/assets/images/parsing-banner.png b/assets/images/parsing-banner.png new file mode 100644 index 00000000..d7a94ea6 Binary files /dev/null and b/assets/images/parsing-banner.png differ diff --git a/content/en/blog/doxa-parsing.md b/content/en/blog/doxa-parsing.md new file mode 100644 index 00000000..bcccde68 --- /dev/null +++ b/content/en/blog/doxa-parsing.md @@ -0,0 +1,73 @@ +--- +title: "Improve Harmony's PDF parsing on DOXA AI" +categories: + - "ai-in-research" +image: "/images/parsing-banner.png" +date: 2024-12-20 +url: "/doxa-parsing/" +--- + +## Train your own Large Language Model to parse PDFs and win up to £1000 in vouchers! + +*Join a competition to train a Large Language Model to improve Harmony's PDF parsing. You don't need to have trained a Large Language Model before.* + +{{< grid columns="2" >}} + {{< card heading="Register on DOXA AI" copy="Enter the competition on DOXA AI by fine tuning your own large language model and improve Harmony!" url="https://doxaai.com/competition/harmony-parsing" >}} + {{< card heading="Join our Discord" copy="Join the Harmony Discord server. Check out the 🏅「matching-challenge」 channel!" url="https://discord.com/invite/harmonydata" >}} +{{< /grid >}} + + + + +We would like to improve Harmony's PDF parsing algorithm. + + +We would like to improve Harmony with a *fine tuned* large language model. We have teamed up with DOXA AI and made an online competition where you can improve on the off-the-shelf LLMs which we are currently using. You can win up to £1000 in vouchers! [Click here to join the Harmony matching competition on DOXA AI](https://doxaai.com/competition/harmony-parsing). + + +{{< grid columns="2" >}} + {{< card heading="Register on DOXA AI" copy="Enter the competition on DOXA AI by fine tuning your own large language model and improve Harmony!" url="https://doxaai.com/competition/harmony-parsing" >}} + {{< card heading="Join our Discord" copy="Join the Harmony Discord server. Check out the 🏅「matching-challenge」 channel!" url="https://discord.com/invite/harmonydata" >}} +{{< /grid >}} + + + +## What about data? + +We have gathered training data for you to use to fine tune your model, and there is unseen validation data which we will use to score the model. + +[More information is available on the DOXA AI page](https://doxaai.com/competition/harmony-parsing). + +## How can I get started? + +First, [create an account on DOXA AI and enroll in the competition](https://doxaai.com/competition/harmony-parsing) and download the code examples and training data. + +## Prizes + +The prize for the winner of the competition is £1000 in vouchers and the runner up will get £500 in vouchers. + +## See also + +[Matching competition](/doxa/) + +## See other events + +* 22 November 2024: [Harmony at Women In Data™️ London Chapter](/open-source-for-social-science/women-in-data/) +* 8 October 2024: [Harmony: a free online tool using LLMs for research in psychology and social sciences](/psychology-ai-tool/aidl-meetup/) at AI|DL London +* 11 and 12 September 2024: [Harmony at MethodsCon Futures](/ai-in-mental-health/harmony-at-methodscon-futures/ +) in Manchester +* 2 July 2024: [Harmony: NLP and generative models for psychology research](/open-source-for-social-science/pydata-meetup/) at Pydata London +* 3 June 2024: [Harmony Hackathon](/open-source-for-social-science/hackathon/) at UCL +* 5 May 2024: [Harmony: A global platform for harmonisation, translation and cooperation in mental health](/ai-in-mental-health/harmony-at-lifecourse-seminar/) at Melbourne Children’s LifeCourse Initiative seminar series. +* 27 March 2024: [Harmony at AI Camp](/psychology-ai-tool/aicamp-meetup/) +* 17 August 2023: [Harmony and TIDAL workshop](/ai-in-mental-health/harmony-and-tidal-workshop) + + + +{{< htmlcode >}} + + + +{{< /htmlcode >}} + +{{< card heading="Register on DOXA AI" copy="Enter the competition on DOXA AI by fine tuning your own large language model and improve Harmony!" url="https://doxaai.com/competition/harmony-parsing" >}}