hadoop - What is best way to multiple dataframe? -


i have 4 data frame. df1 name,id, age

df2 name,id,group

df3 name,id,city

df4 name,id, ctry

i join 4 dataframe macth of same 'name' , 'id'. e.g df

name, id,age,group,city,ctry

try:

list(df1, df2, df3, df4).reduce(_ join (_, seq("name", "id"))) 

Comments

Popular posts from this blog

iOS: Performance of reloading UIImage(name:...) -

How to prevent logback from emitting repeated "Empty watch file list. Disabling" messages -

c# SetCompatibleTextRenderingDefault must be called before the first -