I was wondering if anyone knew inquisit is able to present a video and present text stimuli on top of that video at a certain time....
No. Displaying a video entails constant re-drawing, thus a superimposed <text> would vanish instantly. You must not use <text> consequently, but superimpose another <video>.