MM Dataset Creation Pipeline