It depends on how movies are sent to screen. Does the game use 16 or 24 bit mode? With 16 you can easily render text as regular sprites, but with 24 you need to create blit functions and override slice upload (mdec frames are subdivided into 16x? slices) to impress font pixels over a frame.
Here's some C code
I had for FF7i subtitles.