Update readme -> inference is local (via hugging face), not cloud. Users need to observe gpu / vram. Mention min machine specs.
Update readme -> inference is local (via hugging face), not cloud. Users need to observe gpu / vram. Mention min machine specs.