Spark Issue: RuntimeException: Unsupported Literal Type Class

Jan 15, 2022

Symptom

java.lang.RuntimeException: Unsupported literal type class java.util.ArrayList [1]

Possible Causes

This happens in PySpark when a Python list is provide where a scalar is required. Assuming id0 is an integer column in the DataFrame df, the following code throws the above error.

v = [1, 2, 3 …

PySpark Issue: Java Gateway Process Exited Before Sending the Driver Its Port Number

Oct 10, 2021

I countered the issue when using PySpark locally (the issue can happen to a cluster as well). It turned out to be caused by a misconfiguration of the environment variable JAVA_HOME in Docker.

References

PySpark: Exception: Java gateway process exited before sending the driver its port number

Count Number of Fields in Each Line

Jun 13, 2016

Sometimes, a structured text file might be malformatted. A simple way to verify it is to count the number of fields in each line.

Using awk

You can count the number of fields in each line using the following awk command. Unfortunately, awk does not take escaped characters into consideration …

Quickly Create a Scala Project Using Gradle in Intellij IDEA

Jan 26, 2019

Easy Way

Create a directory (e.g., demo_proj) for your project.
Run gradle init --type scala-library in terminal in the above directory.
Import the directory as a Gradle project in IntelliJ IDEA. Alternatively, you can add apply plugin: 'idea' into build.gradle and then run the command ./gradlew openIdea to …

Visual Studio Code for Python

Mar 30, 2019

Extensions

Please refer to Useful Visual Studio Code Extensions .

Set Python Environment for Visual Studio Code Server

File -> Preference -> Settings
Click on Workspace.
Search for Python Path.
Change Python Path to the one you want to use.

Set Python Path

Debug a Python Project

Visual Studio Live Share

What …

Install Python Packages Behind Firewall

Jul 09, 2014

It is recommended that you use pip to install Python packages.

If you don't already know the proxy in use (in your company), read the post Find out Proxy in Use to figure it out.

Set proxy environment variables.

set http_proxy=http://user:password@proxy_ip:port
set https_proxy=https://user …

← Older Newer →

Ben Chuanlong Du's Blog

And let it direct your passion with reason.