há 1 mês atrás · c6bcbba921
--- a/README.md
+++ b/README.md
@@ -1,6 +1,6 @@
 
				 # 🖼️ Ollama Image Captionizer
			
 
				 
			
 
				-A Python script that uses a local [Ollama](https://ollama.com/) multimodal model to generate captions for your images. It features a rich, interactive terminal user interface (TUI) for easy operation, configuration, and live progress tracking. This is mainly a tool for preparing image datasets for training with FLUX. They are captions, as unlike Stable Diffusion, FLUX relies on natural language processing over keyword processing.
			
 
				+A Python script that uses a local [Ollama](https://ollama.com/) multimodal model to generate captions for your images in bulk. You can use the prompt to guide the vision model to include certain keywords, to describe a certain person by their name. It features a rich, interactive terminal user interface (TUI) for easy operation, configuration, and live progress tracking. This is mostly a helper tool for preparing image datasets for training with FLUX. They are captions, as unlike Stable Diffusion, FLUX relies on natural language processing over keyword processing.
			
 
				 
			
 
				 ![A MacOS iTerm2 window of the Ollama Image Captionizer working its magic through the moondream model](https://images.mitch.science/i/a7710cef-ea63-4206-a5ee-7aee0d244901.jpg)