More sophisticated functionality for generating text from images and apply prompts to images #27

C-Loftus · 2024-03-03T01:45:34Z

Describe images using custom prompts and more sophisticated TTS integration for helping users who are blind or for those that do lots of UI design

~~Have to figure out how to convert files on the clipboard to squares since the openai api only works on square images~~
~~Have to figure out file uploading and deleting since applying edits to the image needs to be done on stored images. Want to make sure that storing images isn't going to cause weird issues with billing~~

… and apply prompts to images

for more information, see https://pre-commit.ci

jaresty · 2024-03-03T03:35:14Z

Images/descriptionPrompt.talon-list

+
+-
+
+structure: Output nothing but your best approximation for the raw semantic HTML without any styling that could be used to create the user interface in the following image. Do not include any boilerplate HTML.


I love this 😀

I'd also like to see a prompt to generate some reasonable component names (I.e. if you were to build this in react, how might you break it up?)

for more information, see https://pre-commit.ci

C-Loftus · 2024-03-05T17:06:16Z

@jaresty When you get a chance maybe you could test this out. I want to improve the CSS styling on the image output description. If you have any bandwidth, improving the CSS for the HTML builder would be the main thing I would like to improve. I have moved it to a new file.

Essentially image description works pretty well and we have two new settings for whether or not we want to open the description in a new webpage and how much content we wanted to describe back.

I think we have most of the functionality here just thinking about improving UX

I think image generation (beyond simple prompting) or any sort of image editing is not really worthwhile at the moment since it only returns square images and we also have to worry about managing file uploads to openai which is sort of beyond the scope of this PR. Image generation really needs a more proper UI/GUI to be done well I think

C-Loftus · 2024-03-06T18:55:23Z

Think the css should be fixed. If this looks good, it should be ready to merge. UX can continue to be improved, but it is generally satisfactory and good to iterate on

C-Loftus and others added 2 commits March 2, 2024 20:43

Some more sophisticated functionality for generating text from images…

2a70b74

… and apply prompts to images

[pre-commit.ci] auto fixes from pre-commit.com hooks

342647a

for more information, see https://pre-commit.ci

C-Loftus changed the title ~~Sophisticated functionality for generating text from images and apply prompts to images~~ More sophisticated functionality for generating text from images and apply prompts to images Mar 3, 2024

jaresty reviewed Mar 3, 2024

View reviewed changes

C-Loftus and others added 2 commits March 5, 2024 12:01

lots of image improvements

9720210

[pre-commit.ci] auto fixes from pre-commit.com hooks

9eb9a8c

for more information, see https://pre-commit.ci

C-Loftus added 2 commits March 6, 2024 13:52

fix css

d8e9263

better style readin in dynamically

ab43f64

Merge branch 'main' into imageChanges

9177e7b

C-Loftus merged commit a0498fc into main Mar 6, 2024
2 checks passed

C-Loftus deleted the imageChanges branch March 6, 2024 19:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More sophisticated functionality for generating text from images and apply prompts to images #27

More sophisticated functionality for generating text from images and apply prompts to images #27

C-Loftus commented Mar 3, 2024 •

edited

Loading

jaresty Mar 3, 2024

jaresty Mar 3, 2024

C-Loftus commented Mar 5, 2024 •

edited

Loading

C-Loftus commented Mar 6, 2024 •

edited

Loading


		-

		structure: Output nothing but your best approximation for the raw semantic HTML without any styling that could be used to create the user interface in the following image. Do not include any boilerplate HTML.

More sophisticated functionality for generating text from images and apply prompts to images #27

More sophisticated functionality for generating text from images and apply prompts to images #27

Conversation

C-Loftus commented Mar 3, 2024 • edited Loading

jaresty Mar 3, 2024

Choose a reason for hiding this comment

jaresty Mar 3, 2024

Choose a reason for hiding this comment

C-Loftus commented Mar 5, 2024 • edited Loading

C-Loftus commented Mar 6, 2024 • edited Loading

C-Loftus commented Mar 3, 2024 •

edited

Loading

C-Loftus commented Mar 5, 2024 •

edited

Loading

C-Loftus commented Mar 6, 2024 •

edited

Loading