Skip to content

MovieAgent quantitative results on ViStoryBench #12

@ViStoryBench

Description

@ViStoryBench

Hello, thank you for your impressive work on MovieAgent! We are the authors of ViStoryBench. We have recently developed ViStoryBench, a comprehensive benchmark suite designed to evaluate story visualization models across diverse narrative structures, visual styles, and character settings. Our benchmark features 80 curated stories, 344 characters, and introduces a suite of 12 automated metrics to systematically assess key aspects like character consistency, style similarity, prompt adherence, and aesthetic quality.

ViStoryBench Evaluation

Image

In our extensive evaluation of over 18 methods, we are delighted to report that MovieAgent demonstrated excellent performance, ranking among the top contenders in several critical metrics.

Notably, MovieAgent (SD3) achieved:

An Alignment Score of 76.1, showcasing its strong ability to adhere to complex textual prompts.

An Inception Score of 15.02, indicating high diversity in generation.

An Aesthetics Score of 5.33, confirming the high visual quality of the output.

Method CSD (Style) Cross CSD (Style) Self CIDS (Character) Cross CIDS (Character) Self Alignment Score OCCM Score Inception Score Aesthetics Score Copy-Paste Degree
MovieAgent (ROICtrl) 20.0 49.3 30.5 51.4 36.5 86.7 11.63 4.65 0.33
MovieAgent (SD3) 30.9 48.3 34.9 52.2 76.1 87.5 15.02 5.33 -0.40

These results, which you can find in Table 2 of our paper, strongly validate MovieAgent's advanced capabilities. We believe our benchmark effectively highlights the technical strengths of your model.

If you find our benchmark helpful, we would be honored if ViStoryBench could be considered for citation in your work. For any questions or further information, please contact us at vistorybench@126.com.

The fine-grained results are shown below.

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions