hadoop - What is best way to multiple dataframe? -
i have 4 data frame. df1 name,id, age
df2 name,id,group
df3 name,id,city
df4 name,id, ctry
i join 4 dataframe macth of same 'name' , 'id'. e.g df
name, id,age,group,city,ctry
try:
list(df1, df2, df3, df4).reduce(_ join (_, seq("name", "id")))
Comments
Post a Comment