Ben Chuanlong Du's Blog

It is never too late to learn.

Spark Issue: AnalysisException: Found Duplicated Columns

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Symptoms

pyspark.sql.utils.AnalysisException: Found duplicate column(s) when inserting into ...

Possible Causes

As the error message says, there are duplicated columns in your Spark SQL code.

Possible Solutions

Fix the duplicated columns issues in your Spark SQL. For example, you can remove duplicated columns from your code or you can use different column names.

Comments