avatarCornellius Yudha Wijaya

Summary

This article introduces four Visual Studio Code extensions that can enhance data processing activities for data scientists.

Abstract

The article "4 Visual Studio Code Extensions to Supercharge Your Data Processing" discusses the benefits of using Visual Studio Code (VS Code) as an Integrated Development Environment (IDE) for data scientists, who might typically use Jupyter Notebook for their data analysis tasks. The author highlights four VS Code extensions that can make the IDE more versatile and powerful for data processing activities. These extensions are Excel Viewer, PDF Preview, Generate Data, and SandDance. Excel Viewer allows users to preview Excel and CSV files, while PDF Preview enables users to read PDF files within VS Code. Generate Data provides dummy data for quick testing or data checking, and SandDance is a VS Code extension for visually exploring CSV data files. The article provides step-by-step instructions for installing and using each extension, along with screenshots to illustrate their functionality.

Bullet points

  • The article introduces four Visual Studio Code extensions that can enhance data processing activities for data scientists.
  • Excel Viewer is a simple VS Code extension that previews Excel and CSV files on the IDE, providing a way for data selection preview and sorting.
  • PDF Preview is a VS Code extension that allows users to read PDF files within the IDE.
  • Generate Data is a VS Code extension that provides dummy data for quick testing or data checking.
  • SandDance is a VS Code extension for visually exploring CSV data files, providing a visualization dashboard to explore data.
  • The article provides step-by-step instructions for installing and using each extension, along with screenshots to illustrate their functionality.

4 Visual Studio Code Extensions to Supercharge Your Data Processing

Improve your data activity with these extensions

Photo by Tim Mossholder on Unsplash

If you are not subscribed as a Medium Member, please consider subscribing through my referral.

As data scientists, we might be comfortable using Jupyter Notebook for everyday data analysis. However, sometimes we must use another IDE for the programming activity — for example, Visual Studio Code.

VS Code is one of the most used programming IDE that offers many extensions. These extensions make the IDE more versatile and powerful for data scientists.

This article will teach four VS Code extensions that supercharge your data processing activity. What are these extensions? Let’s get into it.

1. Excel Viewer

Excel Viewer is a simple VS Code extension that previews Excel and CSV files on the IDE. The extensions also provide a way for data selection preview and sorting, which is nice to have.

To install the extensions, you only need to go to the Extensions Marketplace in your VS Code and find the Excel Viewer, similar to the image below.

Image by Author

When the installation succeeds, we can use the extensions. Right-click the file in your excel or CSV file and select open with. Then choose the CSV Editor if the file is a CSV file similar to the image below.

Image by Author

In your VS Code, you would obtain the data preview.

Image by Author

You can scroll the table to see all the data you have. If you need a more specific data selection, press the columns and get the filter similar to the image below.

Image by Author

It’s a simple extension but helpful when we want to understand our data better.

2. PDF Preview

PDF Preview is an extension to read PDF files in the VS Code. It’s a simple extension but useful in many data processing applications, especially if it involves PDF files.

Try to search the PDF Preview in the extension marketplace, and you will find them easily.

Image by Author

After the installation succeeds, open any PDF file in your VS code. In my case, it would look like the image below.

Image by Author

3. Generate Data

marketplace.visualstudio.com is a VS Code extension that provides dummy data for processing data. It could be used for quick testing or data checking.

To install the extensions, search the Generate Data on the extension marketplace, and you will find them.

Image by Author

After installing the package, we could easily use this extension. Try to press F1 and press either “Generate Data” or “Generate Data — Multiple” if you need a lot of data.

Image by Author

Select the kind of data you want to generate, and you will get them instantly on your file.

Image by Author

4. SandDance

SandDance is a VS Code extension for visually exploring the CSV data file. It’s a great extension to understand our data with nice visualization quickly.

To install the extension, try to look for SandDance in the marketplace extension.

Image by Author

After installing the package, right-click the CSV files and choose View in SandDance.

Image by Author

After that, you will be presented with a visualization dashboard that we can use to explore the data.

Image by Author

Explore the extensions yourself, as many important functions could improve our data processing.

Conclusions

Visual Studio Code is an IDE used in many programming activities, including data processing activities. To supercharge the process, here are four recommended extensions to have:

  1. Excel Viewer
  2. Pdf Viewer
  3. Generate Data
  4. SandDance

I hope it helps!

Visit me on my LinkedIn or Twitter.

Data Science
Programming
Technology
Artificial Intelligence
Education
Recommended from ReadMedium