Ben Chuanlong Du's Blog

And let it direct your passion with reason.

Types of Joins of Spark DataFrames

Comments

  1. It is suggested that you always pass a list of columns to the parameter on even if there's only one column for joining.

  2. None in a pandas DataFrame is converted to NaN instead of null!

  3. Spark allows using following join types:

    • inner (default)
    • cross
    • outer
    • full, fullouter, full_outer
    • left, leftouter, left_outer
    • right, rightouter, right_outer
    • semi, leftsemi, left_semi
    • anti, leftanti, left_anti

Debug Python Project in Visual Studio Code

Ways to Open a Command Palette

  1. Use Menu Menu -> View -> Command Palette....
  2. Use the shortcut Shift + Command + P (on macOS).

Command Palette

You can search for commands in the Command Palette, which makes things very convenient.

Run Tests or a Python File

  1. Open the Command Palette.
  2. Search for Python: Run in the …