I’ve had some pretty remarkable results pasting lecture transcripts from youtube into gpt4 and getting well formatted/relevant markdown summaries from meandering and mis-transcribed content! Needs chunking up but surprisingly effective. It can even generate youtube urls with the right timestamps if you ask it nicely
It's less configurable than what you're describing, but I've found this useful in at least determining if a given video has the content I'm looking for: https://www.summarize.tech/