Installing Apache Beam Python

Author: Al-mamun Sarkar Date: 2020-09-24 14:39:49

In this lesson, I will show you how to install Apache Beam in the python virtual environment. We will create a python virtual environment and install Apache Beam on that virtual env. Python has to be installed on your computer. If your operating system is Linux or mac then python is already installed. For windows download python form python.org and install it.

 

Check Python Version:

python --version

Installing PIP:

pip install --upgrade pip

Install Python virturalenv package:

pip install --upgrade virtualenv

Create a virtual environment:

virtualenv beam

Activate new virtualenv:

source beam/bin/activate

Install Python Apache Beam Package:

pip install apache-beam

 

Run WordCount Example Beam Pipeline:

python -m apache_beam.examples.wordcount --input inputfile --output counts

Replace inputfile by your file name. Such as test.txt