Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] Anyone alive out here ?
- Date: Sun, 8 Sep 2024 12:12:29 +0900
- From: Chris Salisbury <chris.salisbury@example.com>
- Subject: Re: [tlug] Anyone alive out here ?
- References: <f03af814-4ec7-6b38-59fb-3c1851faf7a8@gmail.com> <2d9532be-b1af-42d1-a8cd-8eae13f9f9d5@codewiz.org> <f48593f1-8a3a-440d-9b29-33199b6dcef2@dcook.org> <f6e0bde9-1926-495e-a8c3-14e80d7a0203@codewiz.org>
>>> What's the current state of the art? Interfacing models with text has big limits...
>> Quite closely related, I've been wondering what the state of the art for open-source OCR is, particularly of Japanese text.
> I'm waiting for the first Llama-like LLM with image recognition similar to ChatGPT.Not sure if "similar to ChatGPT" precludes the much worse performance of the "Llama-like" models at https://ollama.com/search?c=vision, but I've been impressed with llava-phi3 for a model small enough to run on a phone that I got for free with a 2-month contract 3 years ago. I do this on (unrooted) android 11 with termux->proot->ollama. But I've never tried OCR with it, Japanese or otherwise.
- References:
- Re: [tlug] Anyone alive out here ?
- From: J. Hart
- Re: [tlug] Anyone alive out here ?
- From: Bernie Innocenti
- Re: [tlug] Anyone alive out here ?
- From: Darren Cook
- Re: [tlug] Anyone alive out here ?
- From: Bernie Innocenti
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] Anyone alive out here ?
- Next by Date: Re: [tlug] [announcement] September 14th Technical meeting
- Previous by thread: Re: [tlug] Anyone alive out here ?
- Next by thread: Re: [tlug] Call for presenters Sept 14th
- Index(es):