Hello, I want to add a new String column to the dataframe based on an existing column values:
from pyspark.sql.functions import lit df.withColumn('strReplaced', lit(df.str.replace("a", "b").replace("c", "d"))) So basically I want to add a new column named "strReplaced", that is the same as the "str" column, just with character "a" replaced with "b" and "c" replaced with "d". When I try the code above I get following error: Traceback (most recent call last): File "<stdin>", line 1, in <module> AttributeError: 'Column' object has no attribute 'replace' So in fact I need somehow to get the value of the column df.str in order to call replace on it. Any ideas how to do this? -- Viktor ARDELEAN *P* Don't print this email, unless it's really necessary. Take care of the environment.