I am jumping into custom agents for the first time and struggling to get the output I’m looking for. Ideally the agent will read a new ticket, identify any image or pdf attachments (inline or attached) and put the content of those images in as an internal note.
Right now, I can get it to paste a lot of the metadata or what it “assumes” the content is but not the actual values. For example if the screenshot has a list of 5 emails + IDs I just want the plaintext version in an internal note.
My current instructions read like
- When a ticket is created, look for any images (e.g., .png, .jpeg, .jpg) or PDFs. These files could be attachments but might also be in-line and there could be multiple images.
- If an image or PDF is found, paste all of the exact text output captured. Do not alter, summarise, or paraphrase the extracted text. Repeat this for all images or PDFs found
- Add the tag image_read to the ticket
There was an interesting demo of this concept at the showcase yesterday, not sure if I’m missing something or I just need some tweaking on the instructions.
