Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Actually I'm looking to take GPT-4 output and create file formats like keynote presentations, or pptx. Is that currently possible with some tools?


I would recommend creating a simplified JSON schema for the slides (say, presentation is an array of slides, each slide has a title, body, optional image, optional diagram, each diagram is one of pie, table, ... Then use a library to generate the pptx file from the content generated.


Library? What library?

It seems to me that a Transformer should excel at Transforming, say, text into pptx or pdf or HTML with CSS etc.

Why don't they train it on that? So I don't have to sit there with manually written libraries. It can easily transform HTML to XML or text bullet points so why not the other formats?


I don't think the name "Transformer" is meant in the sense of "transforming between file formats".

My intuition is that LLMs tend to be good at things human brains are good at (e.g. reasoning), and bad at things human brains are bad at (e.g. math, writing pptx binary files from scratch, ...).

Eventually, we might get LLMs that can open PowerPoint and quickly design the whole presentation using a virtual mouse and keyboard but we're not there yet.


It’s just XML They can produce HTML and transform python into php etc.

So why not? It’s easy for them no?


apparently pandoc also supports pptx

so you can tell GPT4 to output markdown, then use pandoc to convert that markdown to pptx or pdf.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: