Add an implementation of HF model and an example sentiment analysis a… #194

mhawasly · 2023-08-31T11:24:02Z

No description provided.

…sset using it

fdalvi

Thanks a lot for starting this, will be very good to have in the package; I left a few comments, and have some more higher level questions:

Is this tied to a specific class of models? I see in the Inference docs https://huggingface.co/docs/api-inference/detailed_parameters that the parameters differ per task type? If not lets leave a comment (perhaps module level) that tells the users what they are expected to put in the "prompt fn"
Currently, there is no "input" key in the payload we send to the API, while all samples in their docs have the key - maybe "text" was an older API that is not official anymore?
Perhaps we can make things even clearer by changing the name of the model to HuggingFaceInferenceAPIModel; I know thats quite a mouthful but there are HuggingFace Inference Endpoints that are paid, so we should disambiguate from that (+ also disambiguate from general HuggingFace models that run locally)

Let me know what you think and if something is unclear!

llmebench/models/HuggingFace.py

mhawasly · 2023-09-04T13:10:50Z

Thanks a lot for starting this, will be very good to have in the package; I left a few comments, and have some more higher level questions:
* Is this tied to a specific class of models? I see in the Inference docs `https://huggingface.co/docs/api-inference/detailed_parameters` that the parameters differ per task type? If not lets leave a comment (perhaps module level) that tells the users what they are expected to put in the "prompt fn"

Indeed, different task types have different input/output formats and types. I tried only an example from the classification type.

* Currently, there is no "input" key in the payload we send to the API, while all samples in their docs have the key - maybe "text" was an older API that is not official anymore?

Interesting. I did try that but "inputs" does not work, at least for the model I used in the asset. It returns {"error":"You need to specify either textortext_target.","warnings":["There was an inference error: You need to specify either textortext_target."]}'

* Perhaps we can make things even clearer by changing the name of the model to `HuggingFaceInferenceAPIModel`; I know thats quite a mouthful but there are HuggingFace Inference Endpoints that are paid, so we should disambiguate from that (+ also disambiguate from general HuggingFace models that run locally)

Sure!

Let me know what you think and if something is unclear!

fdalvi

Thanks a lot for the changes; The only big thing I'm worried about is the mismatching APIs ("inputs" in the docs vs "text" that seems to work for us) - we should get this committed nevertheless, but if you have some time lets try to dig into this and see why this is happening

llmebench/models/HuggingFace.py

fdalvi · 2023-09-05T06:48:06Z

llmebench/models/HuggingFace.py

+        )
+        if not response.ok:
+            if response.status_code == 503:  # model loading
+                time.sleep(1)


Any particular reason for the sleep here?

Hoping to give the model some time to load before retrying?

The retry mechanism has an inherent random delay (that gets sampled from an exponentially increasing range), so we don't need to worry about it here

assets/benchmark_v1/sentiment/sentiment/ArSASSentiment_HF_ZeroShot.py

The returned format does not include the original text, and the dataset grountruth are labeled sentences which cannot be recovered using the model output alone.

Add an implementation of HF model and an example sentiment analysis a…

166e8bc

…sset using it

mhawasly requested a review from fdalvi August 31, 2023 11:24

mhawasly assigned fdalvi, firojalam and mhawasly Aug 31, 2023

fdalvi reviewed Sep 3, 2023

View reviewed changes

mhawasly added 3 commits September 4, 2023 16:15

Address review suggestions

d50f69c

Fix formatting

86418e7

Fix more formatting

0cd5342

fdalvi requested changes Sep 5, 2023

View reviewed changes

mhawasly and others added 9 commits September 5, 2023 16:08

Format summary based on task type

1513d09

Delete NER task

83bbaaa

The returned format does not include the original text, and the dataset grountruth are labeled sentences which cannot be recovered using the model output alone.

Rename models and other touches

119b87e

Add docstring to HuggingFaceInferenceAPI

dd71026

Merge branch 'main' into feat/hf_api_model

7c9b842

Fix ArSAS asset

0b80723

Modify model tests to use only ModelBase derived classes

f019a85

Add explicit check for api token and env var based config

89d90aa

Removed hardcoded env var from assets

5c7a45a

fdalvi approved these changes Sep 10, 2023

View reviewed changes

fdalvi added 3 commits September 10, 2023 13:54

Fix missing/spurious imports

76b1ba3

Add tests for HuggingFaceInferenceAPI models

33a3bf4

Remove dead code

ab3a57b

fdalvi approved these changes Sep 10, 2023

View reviewed changes

fdalvi merged commit 07f4bf6 into main Sep 10, 2023
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an implementation of HF model and an example sentiment analysis a… #194

Add an implementation of HF model and an example sentiment analysis a… #194

mhawasly commented Aug 31, 2023 •

edited

Loading

fdalvi left a comment

mhawasly commented Sep 4, 2023

fdalvi left a comment

fdalvi Sep 5, 2023

mhawasly Sep 5, 2023

fdalvi Sep 6, 2023

Add an implementation of HF model and an example sentiment analysis a… #194

Add an implementation of HF model and an example sentiment analysis a… #194

Conversation

mhawasly commented Aug 31, 2023 • edited Loading

fdalvi left a comment

Choose a reason for hiding this comment

mhawasly commented Sep 4, 2023

fdalvi left a comment

Choose a reason for hiding this comment

fdalvi Sep 5, 2023

Choose a reason for hiding this comment

mhawasly Sep 5, 2023

Choose a reason for hiding this comment

fdalvi Sep 6, 2023

Choose a reason for hiding this comment

mhawasly commented Aug 31, 2023 •

edited

Loading