I’m happy to introduce TextEvolve, a system that iteratively generates and tests new programs over your dataset, evolving its approach with LLM evaluation.
TextEvolve changes program flow, tries new ideas, and outputs optimized programs in the form of Python scripts.
Initial results are very strong! TextEvolve produces programs that outperform the base model and SoTA methods across a wide range of domains:


Excited to continue working on this. See github for a feature roadmap.
