Perplexity Pro lets you upload images and extract text from them using optical character recognition technology. This feature saves you from retyping printed documents, scanned notes, or screenshots that contain text. You can ask Perplexity to read the text in an image and use it as context for follow-up questions. This article explains how to upload images for OCR in Perplexity Pro and what to do when the extracted text is not accurate.
Key Takeaways: Extracting Text From Images in Perplexity Pro
- Image upload button in the query bar: Click the camera icon to select an image from your device or paste one directly.
- Focus mode selection: Use the “Web” or “Academic” focus for better OCR accuracy on documents with dense text.
- Pro Search toggle: Enable Pro Search to let Perplexity analyze the image more thoroughly and correct misread characters.
What Perplexity Pro Image Upload for OCR Does
Perplexity Pro can read text from image files using OCR technology built into its multimodal model. The feature works with JPEG, PNG, GIF, and WebP formats. You can upload a photo of a menu, a scanned contract, a whiteboard, or a screenshot of a PDF and ask Perplexity to summarize, translate, or extract specific data from it.
The OCR process happens inside the conversation thread. Perplexity reads the image and converts visible text into machine-readable characters. The extracted text then becomes part of the context for your current query. You can ask follow-up questions about the text without re-uploading the image.
Before using this feature, make sure you have an active Perplexity Pro subscription. Free accounts cannot upload images. You also need a stable internet connection because the image is sent to Perplexity servers for processing.
Steps to Upload an Image for OCR in Perplexity Pro
- Open Perplexity in your browser or app
Go to perplexity.ai and log in with your Pro account. On mobile, open the Perplexity app. - Click the camera icon in the query bar
The icon is located on the left side of the text input field. It looks like a small camera. Click it to open the file picker. - Select an image from your device
Choose a JPEG, PNG, GIF, or WebP file that contains text. You can also paste an image directly by pressing Ctrl+V on Windows or Cmd+V on Mac. - Type your OCR request
After the image appears in the query bar, type a clear instruction. For example: “Extract all text from this image” or “Read the text in this photo and list the names.” - Enable Pro Search for better accuracy
Toggle the Pro Search button (lightning bolt icon) to on. This gives Perplexity more compute power to analyze the image and correct common OCR errors. - Press Enter to send
Perplexity processes the image and returns the extracted text in the response. Review the output and ask follow-up questions if needed.
Using Focus Mode to Improve OCR Results
Perplexity offers focus modes that affect how the system interprets the image. For OCR tasks, the Web focus mode works well because it combines the extracted text with web search results to verify accuracy. The Academic focus mode is better for scientific documents or papers. The Writing focus mode is useful when you want to rewrite the extracted text into a different format, such as a summary or bullet points.
Common Issues When Using Perplexity Pro Image Upload for OCR
Perplexity Does Not Read Text From the Image
If Perplexity responds with a description of the image instead of the text, your prompt may be too vague. Be explicit: write “Read the text from this image” or “Extract all characters visible in the photo.” Also check that the image is not too small or blurry. Resize the image to at least 800 pixels on the longest side before uploading.
Extracted Text Contains Errors or Missing Characters
OCR accuracy depends on image quality and font type. Handwritten text, stylized fonts, and low-contrast images produce more errors. Enable Pro Search and select the Web focus mode to let Perplexity cross-reference the extracted text with similar documents online. You can also crop the image to remove background noise before uploading.
Image Upload Button Is Grayed Out
This happens when you are using a free Perplexity account. Only Pro subscribers can upload images. Go to Settings > Subscription to verify your plan. If you have an active subscription, log out and log back in to refresh the session.
Spaces Disappear After Logging in From a New Device
Perplexity Pro image upload is tied to your account, not a specific device. If you log in from a new device and the upload feature is missing, clear the browser cache or reinstall the app. Your uploaded images and conversations remain available in the thread history.
| Item | Free Account | Pro Account |
|---|---|---|
| Image upload | Not available | Available |
| OCR accuracy | N/A | High with Pro Search |
| Supported formats | N/A | JPEG, PNG, GIF, WebP |
| File size limit | N/A | 20 MB per image |
| Pro Search toggle | Not available | Available |
You can now upload images to Perplexity Pro and extract text using OCR technology. Start by clicking the camera icon in the query bar and typing a specific extraction request. For best results, enable Pro Search and use the Web focus mode. If the extracted text contains errors, crop the image to remove distractions and rephrase your prompt to be more explicit about what text you need.