This is my simple joycaption workflow, for captioning images intended for SDXL or FLUX models. It offers a single word replacement node, as well as a node to add additional captions or tags to the output.
TODO:
Find a way to detect poorly detected text strings and replace them with known good matches.

.jpeg)