You save a spreadsheet as a CSV file, but when you open it, the text is corrupted with strange symbols. This happens because Excel uses a default text encoding that does not support all characters. This article explains how to save and open CSV files with the correct encoding to preserve your data.
Key Takeaways: Fixing CSV File Encoding in Excel
- File > Save As > CSV UTF-8: This is the most reliable format for saving files with international characters and emojis.
- Import via Data > Get Data > From Text/CSV: Use the Power Query editor to manually select the correct file encoding when opening a corrupted CSV.
- Windows Notepad Encoding Conversion: Open a garbled CSV in Notepad and use File > Save As to change the encoding to UTF-8.
Why CSV Files Lose Their Character Encoding
A CSV file is a plain text file. It contains data separated by commas, but it has no information about fonts, colors, or formulas. The text itself must be stored using a specific character encoding standard, which is a map between numbers and letters.
Excel for Windows has historically used an encoding called ANSI, which is based on your system’s regional language setting. This encoding cannot display characters from other languages, such as Chinese, Japanese, or special symbols like emojis. When you save a file containing these characters using the standard “CSV (Comma delimited)” option, Excel cannot write them correctly. It replaces the unsupported characters with garbled symbols like é or ?.
The modern standard for text files is UTF-8, which can represent every character in the Unicode standard. To avoid corruption, you must explicitly tell Excel to use UTF-8 encoding when saving or opening a CSV file.
Steps to Save and Open CSV Files With Correct Encoding
Method 1: Save a New CSV File as UTF-8 in Excel
Use this method when you are creating a CSV from an Excel workbook and need to preserve all characters.
- Prepare your data in Excel
Ensure your worksheet contains the data you want to export. Save your work as a regular .xlsx file first as a backup. - Go to File > Save As
Click the File tab in the ribbon, then select Save As. Choose the folder where you want to save the CSV file. - Choose the CSV UTF-8 format
In the Save As dialog box, click the “Save as type” dropdown menu. Scroll down and select “CSV UTF-8 (Comma delimited) (*.csv)”. This is the critical step. - Name and save the file
Enter a file name and click Save. Excel will show warnings about features not being saved in CSV format; click OK to proceed. Your file is now saved with UTF-8 encoding.
Method 2: Open a Garbled CSV File With Correct Encoding
Use this method when you receive a corrupted CSV file and need to open it correctly in Excel.
- Launch the import wizard
In Excel, go to the Data tab. In the Get & Transform Data group, click From Text/CSV. Do not double-click the file in Windows Explorer. - Select the corrupted file
Navigate to and select the CSV file with garbled characters, then click Import. The Power Query editor window will open. - Change the file origin
In the preview pane at the bottom of the editor, look for a dropdown labeled “File Origin”. Click it and try different encodings, such as UTF-8 or Windows-1252. The preview will update instantly. - Load the data
Once the text in the preview looks correct, click the Load button. Excel will import the data with the chosen encoding into a new worksheet.
Method 3: Use Notepad to Convert Encoding
This is a universal fix that does not require Excel. Right-click the garbled CSV file and select Open with > Notepad. The text will likely still appear corrupted.
- Open the Save As dialog
In Notepad, click File > Save As. - Set the encoding
At the bottom of the Save As dialog, find the “Encoding” dropdown. Change it from ANSI to UTF-8. - Save the file
You can use the same file name to overwrite, or save a new copy. Click Save. The file is now converted and can be opened correctly in Excel.
If Your CSV File Is Still Displaying Incorrect Characters
Excel Shows Question Marks or Boxes for Certain Characters
This usually means the font in Excel does not support the characters. After importing the data correctly, select the cells with the problem. Go to the Home tab and change the Font to a universal type like Arial or Segoe UI. These fonts have broad Unicode support.
The Data Separator Is Wrong After Saving as UTF-8 CSV
Some European systems use a semicolon as a list separator instead of a comma. When you open a UTF-8 CSV, Excel might put all data into one column. To fix this, open the file using the Data > From Text/CSV method. In the Power Query editor, change the “Delimiter” setting from Comma to Semicolon before loading.
Special Characters Are Lost When Re-opening a Saved CSV
If you save as UTF-8 CSV, open it, edit it, and save again, Excel might revert to the old ANSI encoding. Always use File > Save As and manually select “CSV UTF-8” every time you save. Do not just click the standard Save icon.
CSV Save Options in Excel: A Comparison
| Item | CSV (Comma delimited) | CSV UTF-8 (Comma delimited) |
|---|---|---|
| Default Encoding | ANSI (Windows-1252) | UTF-8 |
| Character Support | Basic Latin alphabet for your region | Full Unicode, including emojis and all global scripts |
| Best For | Simple English data with no accents | International data, multilingual text, special symbols |
| Opening Method | Double-click in File Explorer | Use Data > From Text/CSV for reliable results |
| System Compatibility | Older systems and region-specific software | Modern web applications, databases, and cross-platform systems |
You can now save and open CSV files without losing special characters. Always select the CSV UTF-8 option in the Save As dialog for any data beyond basic English. For opening files, get into the habit of using the Data tab’s import tool instead of double-clicking. As an advanced tip, you can set the system locale in Windows to force a different default encoding, but using UTF-8 is the more reliable modern standard.