Excel CSV Saved With Garbled Characters: How to Fix the File Encoding
🔍 WiseChecker

Excel CSV Saved With Garbled Characters: How to Fix the File Encoding

You save a spreadsheet as a CSV file, but when you open it, the text is corrupted with strange symbols. This happens because Excel uses a default text encoding that does not support all characters. This article explains how to save and open CSV files with the correct encoding to preserve your data.

Key Takeaways: Fixing CSV File Encoding in Excel

  • File > Save As > CSV UTF-8: This is the most reliable format for saving files with international characters and emojis.
  • Import via Data > Get Data > From Text/CSV: Use the Power Query editor to manually select the correct file encoding when opening a corrupted CSV.
  • Windows Notepad Encoding Conversion: Open a garbled CSV in Notepad and use File > Save As to change the encoding to UTF-8.

Why CSV Files Lose Their Character Encoding

A CSV file is a plain text file. It contains data separated by commas, but it has no information about fonts, colors, or formulas. The text itself must be stored using a specific character encoding standard, which is a map between numbers and letters.

Excel for Windows has historically used an encoding called ANSI, which is based on your system’s regional language setting. This encoding cannot display characters from other languages, such as Chinese, Japanese, or special symbols like emojis. When you save a file containing these characters using the standard “CSV (Comma delimited)” option, Excel cannot write them correctly. It replaces the unsupported characters with garbled symbols like é or ?.

The modern standard for text files is UTF-8, which can represent every character in the Unicode standard. To avoid corruption, you must explicitly tell Excel to use UTF-8 encoding when saving or opening a CSV file.

Steps to Save and Open CSV Files With Correct Encoding

Method 1: Save a New CSV File as UTF-8 in Excel

Use this method when you are creating a CSV from an Excel workbook and need to preserve all characters.

  1. Prepare your data in Excel
    Ensure your worksheet contains the data you want to export. Save your work as a regular .xlsx file first as a backup.
  2. Go to File > Save As
    Click the File tab in the ribbon, then select Save As. Choose the folder where you want to save the CSV file.
  3. Choose the CSV UTF-8 format
    In the Save As dialog box, click the “Save as type” dropdown menu. Scroll down and select “CSV UTF-8 (Comma delimited) (*.csv)”. This is the critical step.
  4. Name and save the file
    Enter a file name and click Save. Excel will show warnings about features not being saved in CSV format; click OK to proceed. Your file is now saved with UTF-8 encoding.

Method 2: Open a Garbled CSV File With Correct Encoding

Use this method when you receive a corrupted CSV file and need to open it correctly in Excel.

  1. Launch the import wizard
    In Excel, go to the Data tab. In the Get & Transform Data group, click From Text/CSV. Do not double-click the file in Windows Explorer.
  2. Select the corrupted file
    Navigate to and select the CSV file with garbled characters, then click Import. The Power Query editor window will open.
  3. Change the file origin
    In the preview pane at the bottom of the editor, look for a dropdown labeled “File Origin”. Click it and try different encodings, such as UTF-8 or Windows-1252. The preview will update instantly.
  4. Load the data
    Once the text in the preview looks correct, click the Load button. Excel will import the data with the chosen encoding into a new worksheet.

Method 3: Use Notepad to Convert Encoding

This is a universal fix that does not require Excel. Right-click the garbled CSV file and select Open with > Notepad. The text will likely still appear corrupted.

  1. Open the Save As dialog
    In Notepad, click File > Save As.
  2. Set the encoding
    At the bottom of the Save As dialog, find the “Encoding” dropdown. Change it from ANSI to UTF-8.
  3. Save the file
    You can use the same file name to overwrite, or save a new copy. Click Save. The file is now converted and can be opened correctly in Excel.

If Your CSV File Is Still Displaying Incorrect Characters

Excel Shows Question Marks or Boxes for Certain Characters

This usually means the font in Excel does not support the characters. After importing the data correctly, select the cells with the problem. Go to the Home tab and change the Font to a universal type like Arial or Segoe UI. These fonts have broad Unicode support.

The Data Separator Is Wrong After Saving as UTF-8 CSV

Some European systems use a semicolon as a list separator instead of a comma. When you open a UTF-8 CSV, Excel might put all data into one column. To fix this, open the file using the Data > From Text/CSV method. In the Power Query editor, change the “Delimiter” setting from Comma to Semicolon before loading.

Special Characters Are Lost When Re-opening a Saved CSV

If you save as UTF-8 CSV, open it, edit it, and save again, Excel might revert to the old ANSI encoding. Always use File > Save As and manually select “CSV UTF-8” every time you save. Do not just click the standard Save icon.

CSV Save Options in Excel: A Comparison

Item CSV (Comma delimited) CSV UTF-8 (Comma delimited)
Default Encoding ANSI (Windows-1252) UTF-8
Character Support Basic Latin alphabet for your region Full Unicode, including emojis and all global scripts
Best For Simple English data with no accents International data, multilingual text, special symbols
Opening Method Double-click in File Explorer Use Data > From Text/CSV for reliable results
System Compatibility Older systems and region-specific software Modern web applications, databases, and cross-platform systems

You can now save and open CSV files without losing special characters. Always select the CSV UTF-8 option in the Save As dialog for any data beyond basic English. For opening files, get into the habit of using the Data tab’s import tool instead of double-clicking. As an advanced tip, you can set the system locale in Windows to force a different default encoding, but using UTF-8 is the more reliable modern standard.