mastodon.ar.al is one of the many independent Mastodon servers you can use to participate in the fediverse.
This is my personal fediverse server.

Administered by:

Server stats:

1
active users

Aral Balkan

Run Linux, post on the fediverse, and care about accessibility?

You need Frog.

github.com/TenderOwl/Frog#read

Frog extracts text from images.

Use it to copy and paste the text in images into your image descriptions and help make the fediverse more inclusive.

GitHubGitHub - TenderOwl/Frog: Extract text from any image, video, QR Code and etc.Extract text from any image, video, QR Code and etc. - GitHub - TenderOwl/Frog: Extract text from any image, video, QR Code and etc.
#frog#ocr#altText

@aral
So Frog should actually be an integrated part of the Fediverse, either at server or client level. As soon as I upload an image with text, Frog should run and add alt text automagically.

@cmyrland Mastodon has a OCR utility in the edit image feature. I guess it even uses the same backend as Frog, Tesseract.

@aral

@joenepraat @aral Whoa, that's nifty. Didn't know that. And in that case, it should pop up and ask if you want the image to be OCR'ed automagically.

@joenepraat @aral tried the mastodon thing now, and it works really great. It's not 100% perfect with letters outside the english alphabet (at least not norwegian æ ø å), but it's super helpful to get the most of the work done quickly.

@cmyrland It’s not really a replacement for a human being explaining what is in the image to another human being. But it is a useful tool when your image contains the text.

@aral Yes, of course. I don't expect a relatively simple program to be able to describe a picture - not even the most powerful AIs are capable of that yet.

But for OCR-ing text it's very helpful, especially as a screenshot from a webpage can contain lots of text, and you don't always know the source for simple copy/pasting..

@aral thanks it looks really good. I'll try to use it

@tio Looks very neat; thanks :)

@aral
With the rate of current software development, it probably won't be too long before there's a free to use (but sadly likely not open source) online tool which examines an image (and optionally the metadata, e.g. Geotagging) and suggests various lengths / details of descriptions (e.g. depending on where you intend to post the image and the context of accompanying text, people may need to know it's a hanging basket, a hanging basket with red, yellow and blue flowers, or a hanging basket with red petunias, yellow impatiens and blue lobelia, or a hanging basket on Felpersham High Street etc etc). I'd put my nonexistent bets on a certain company famous for its search engine but less famous for being the world's largest online ad broker being (one of?) the first.

@aral

If the developers were well resourced enough to offer a free API, you could potentially see FOSS tools interfacing with it (likely offering to resize and/or strip all EXIF data - if the online tool could generate the description of what's in the image, the FOSS tool could add in geolocation in a privacy respecting way (e.g. Hooking in to any locally stored OSM maps)).