PySpark, MongoDB and Bokeh

Total study time: 24 hours

How can you quickly analyze massive datasets and turn them into clear, visual insights? In this one-on-one course, you’ll explore the power of PySpark, MongoDB, and Bokeh to transform data and bring it to life. With personal guidance and hands-on projects supported by online modules, you’ll gain practical skills you can apply right away—taking your data-driven work to the next level.

What are PySpark, MongoDB and Bokeh?

PySpark is a powerful tool for working with big data. Built on Apache Spark, it enables fast processing of huge datasets, making it perfect for data analysis and machine learning where speed and scale matter. With PySpark, you can clean, transform, and analyze large volumes of data efficiently.

MongoDB is a flexible, scalable NoSQL database. Unlike traditional databases, it stores data in JSON-like documents, which makes it ideal for handling unstructured or semi-structured information—especially geospatial data.

Bokeh is a modern tool for interactive data visualization. It lets you build dynamic charts and dashboards you can view and share online. It’s especially useful when working with maps or spatial data, making your insights easier to understand and explore.

What will you learn in this blended learning course?

In this course, you’ll build real-world skills in data processing and visualization using PySpark, MongoDB, and Bokeh.

You’ll start by building data pipelines in PySpark. Using DataFrames, you’ll learn how to clean, transform, and prepare big data for analysis. Then, you’ll apply machine learning techniques using Spark’s MLlib—helping you uncover patterns in complex geospatial datasets.

Next, you’ll work in Jupyter Notebook, combining PySpark, MongoDB, and Bokeh in a single workflow. You’ll write code, explore data, and create visualizations—all in one place.

With MongoDB, you’ll learn how to efficiently manage unstructured data, especially when dealing with fast-growing geospatial datasets.

Then, using Bokeh, you’ll design and build interactive dashboards that clearly communicate your insights. You’ll also learn how to set up a lightweight server to share your visualizations with others.

Finally, you’ll cover the basics of geo-mapping, so you can clearly display the spatial patterns in your data.

Why choose this PySpark, MongoDB and Bokeh course?

Blended learning gives you the best of both worlds—live interaction and flexible self-paced study—so you can build job-ready skills at your own pace.

We start with a live session where you’ll dive into real-world datasets. Guided by data experts, you’ll process large-scale data using PySpark, organize flexible geospatial records in MongoDB, and begin creating interactive dashboards with Bokeh.

Next, our self-paced modules let you expand your knowledge step by step. You’ll explore topics like data pipelines, NoSQL databases, and machine learning. Along the way, you’ll clean and transform data, structure it in MongoDB, and visualize it using Bokeh.

Then, in a second live session, you’ll apply your new skills to realistic challenges. You’ll get feedback from instructors, troubleshoot issues, and refine your workflow for better results.

One of the highlights of this course is its practical focus. You’ll build real, usable outputs—like predictive models and interactive dashboards—that can be directly applied in your job or research.

By the end, you won’t just understand how these tools work—you’ll know how to use them to drive smarter, faster, and more informed decisions.

Sign up for this
Blended Learning

    Price: €395 (excl. VAT)



    Start:
     2-hour online session


    Self-study:
     Review course materials


    End:
     1-hour online session



    You’ll receive 1-on-1 guidance. After signing up, our course coordinator will contact you to schedule your first session.

    Learning Outcomes

    After completing this course, you’ll be able to:

    • Analyze large datasets using PySpark for machine learning and data exploration
    • Work efficiently with MongoDB to manage NoSQL databases and geospatial data
    • Build and share interactive dashboards with Bokeh
    • Use Jupyter Notebooks to document and present your data analysis
    • Apply advanced data techniques to support smarter, data-driven decisions

    More Information?

    Do you have questions about the course content? Not sure if the course aligns with your learning objectives? Or would you prefer a private session or in-company training? We’re happy to assist—feel free to get in touch.

    Frequently Asked Questions

    We aim to make our courses accessible to as many people as possible. If the course fee is a concern, instead of registering directly, you can indicate on the registration form that you’d like to receive a quote tailored to your needs. In many cases, we can offer flexible solutions. For instance, we can adjust the course content for large groups, shorten the course based on existing knowledge, or offer daily rates to suit your requirements.

    Yes, you can reach out to the instructor with questions for up to 2 weeks after the course. Since the instructor is likely teaching other courses, we recommend emailing your questions to info@geo-ict.com. We’ll forward your inquiry to the instructor, and you’ll receive a response within 24 hours. After the 2-week period, we recommend using our Personalized Online Support for continued assistance.

     

    Yes, we offer on-site training regularly throughout the Netherlands. Our instructor will bring laptops for the participants, and all you need to do is arrange a suitable room at your location.

    Please send your requirements to info@geo-ict.com, and we will provide a customized quote, which will include travel and accommodation costs. Once the quote is confirmed, our course coordinator will reach out to schedule the training days.

    After each course, participants receive a link to our evaluation portal where you can share your feedback on what you liked and didn’t like. We strive to provide a great experience for all our participants, but if you have a complaint, please click on ‘Complaints Procedure’ in the portal. This document will guide you through the steps to take. Geo-ICT Training Center, Netherlands, is a member of the Dutch Council for Training and Education (NRTO), ensuring a fair and transparent process.

    Courses are typically scheduled according to the Dutch time zone, with sessions running from 9:00 AM to 12:00 PM and 1:00 PM to 4:00 PM. For participants in different time zones, we adjust the course times in consultation with you.