avatarSkanda Vivek

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

5081

Abstract

e tools — and write accessible blogs for a wide range of folks. Non-technical folks can skim these articles, and technical folks can read through in detail. I also provide code snippets and GitHub links for those of you who are interested in replicating the results/applying to your own use cases. I’ve had many folks reach out on <a href="https://www.linkedin.com/in/skanda-vivek-01619311b/">LinkedIn</a> regarding questions on how to use what I demonstrated on their use cases. So feel free to reach out!</p><h1 id="d29f">My Journey Through Blogs</h1><p id="a815">As promised — I’ll talk about how I reached here, chronicled through my blogs!</p><p id="7886">The first blog I wrote was in late November, 2019 — about research I was doing as a post-doc, modeling the impacts of large-scale cyber attacks on vehicles. The blog had a very small reach, but I got an email from The Startup — requesting whether I would like to publish my blog in their publication.</p><p id="a12b">At the time I had no idea about the blog — but I realized it was the biggest Medium publication at the time!</p><div id="0785" class="link-block"> <a href="https://readmedium.com/what-if-the-next-large-scale-hack-involved-your-vehicle-instead-of-your-security-camera-45ba0895861d"> <div> <div> <h2>What if the next large-scale hack involved your vehicle instead of your security camera?</h2> <div><h3>But my vehicle doesn’t connect to the internet….Are you sure? Statista estimates 40% of vehicles connect to the…</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*CybmIPKGurX3-puE6ca82g.png)"></div> </div> </div> </a> </div><p id="b579">That single experience made me seriously consider writing regular blogs. After this, I published my first blog in Towards Data Science about traffic patterns:</p><div id="c739" class="link-block"> <a href="https://towardsdatascience.com/visualizing-real-time-traffic-patterns-using-here-traffic-api-5f61528d563"> <div> <div> <h2>Visualizing real-time traffic patterns using HERE traffic api</h2> <div><h3>While Google Maps shows live traffic, there’s no way to access the underlying traffic data. HERE technologies offers a…</h3></div> <div><p>towardsdatascience.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*KvboAEdsQ9G3rGGJoFlwCA.gif)"></div> </div> </div> </a> </div><p id="9155">Next, I wrote a series of blogs on the “science of everyday phenomena” — which also corresponded to a course that I was teaching at the time and an open-source book I made for the course. I created a new publication “Emergent Phenomena” for these articles:</p><div id="54f3" class="link-block"> <a href="https://medium.com/emergent-phenomena"> <div> <div> <h2>Emergent Phenomena</h2> <div><h3>A medium publication for discovering and sharing secrets of our complex everyday world</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*YpgFKnltn-kCV6xZdf4l7A.png)"></div> </div> </div> </a> </div><p id="54bb">While these articles on everyday science were not super widely read — they provided me with a routine and consistency by which I paced writing and publishing. It was also during COVID — where the college shutdown for quite a few months and I spent a lot of time writing.</p><p id="646c">I was getting ~2–3k monthly views at this point, and mentoring beginners in data science — when I decided to transition to the industry. I wrote a blog about this as well:</p><div id="0a36" class="link-block"> <a href="https://towardsdatascience.com/how-i-transitioned-from-academia-to-the-data-science-industry-d5e00479fea1"> <div> <div> <h2>How I Transitioned from Academia To the Data Science Industry</h2> <div><h3>One day I realized it was time for a new adventure. Here’s how small consistent efforts laid the foundations for my…</h3></div> <div><p>towardsdatascience.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*lUQ5qddHeHo5dDw8Deg0BA.jpeg)"></div> </div> </div> </a> </div><p id="4c44">Soon after, I started experimenting with NLP and fine-tuning transformer based models for custom applications. So I wrote a blog on this:</p><div id="6ba0" class="link-block"> <a href="https://towardsdat

Options

ascience.com/fine-tune-transformer-models-for-question-answering-on-custom-data-513eaac37a80"> <div> <div> <h2>Fine-Tune Transformer Models For Question Answering On Custom Data</h2> <div><h3>A tutorial on fine-tuning the Hugging Face RoBERTa QA Model on custom data and obtaining significant performance boosts</h3></div> <div><p>towardsdatascience.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*Ktn9Zcg-3JKWAu8MzSqIeg.png)"></div> </div> </div> </a> </div><p id="fc7d">After this blog, my monthly views became more like 8–10k. Since then, I have written multiple blog articles on NLP, AI, and industrial applications:</p><p id="0c03"><a href="https://readmedium.com/when-should-you-fine-tune-llms-2dddc09a404a">When Should You Fine-Tune LLMs?</a></p><p id="7d30"><a href="https://readmedium.com/llm-economics-chatgpt-vs-open-source-dfc29f69fec1">LLM Economics: ChatGPT vs Open-Source</a></p><p id="619c"><a href="https://readmedium.com/how-do-you-build-a-chatgpt-powered-app-89c83f3e2143">How Do You Build A ChatGPT-Powered App?</a></p><p id="97ae"><a href="https://readmedium.com/extractive-vs-generative-q-a-which-is-better-for-your-business-5a8a1faab59a">Extractive vs Generative Q&A — Which is better for your business?</a></p><p id="5423"><a href="https://readmedium.com/fine-tune-transformer-models-for-question-answering-on-custom-data-513eaac37a80">Fine-Tune Transformer Models For Question Answering On Custom Data</a></p><p id="708e"><a href="https://readmedium.com/unleashing-the-power-of-generative-ai-for-your-customers-70297f1c9698">Unleashing the Power of Generative AI For Your Customers</a></p><p id="3f1a"><a href="https://readmedium.com/build-industry-specific-llms-using-retrieval-augmented-generation-af9e98bb6f68">Build Industry-Specific LLMs Using Retrieval Augmented Generation</a></p><p id="36e0">After a few such articles on Generative AI to solve real-world business problems, my monthly views rocketed to 30k+!</p><h1 id="432d">The Future — A Roadmap</h1><p id="a55d">I plan to write exclusively on AI and industry applications over the next year. I’ve seen that this is a niche that is very important because of all the folks trying to adopt Generative AI (ChatGPT/open-source LLMs) into their business needs. There are great folks making fundamental innovations at OpenAI, Google, etc. and writing research papers as well as communicating this research to the public. At the same time, folks at places like Hugging Face and AWS write brilliant tutorials on deploying state of the art open-source LLMs on their platform.</p><p id="9388">However, there is a lack of information on how to integrate these fundamental innovations to specific industry settings. For example, one thing that was really lacking was a detailed price comparison between open-source and closed-source LLMs. Or a discussion about what sort of LLM to use for what case. This is where I plan to focus my content (and have done so over the last ~10 or so blogs).</p><p id="93dc">So stay tuned as I explore the nitty gritties of AI, Deploying models at scale, cost vs quality, and lots more — packaged in a way that is tailored to solve business needs. I also created a YouTube channel a while ago (where I’m now going to be more active on).</p><div id="3582" class="link-block"> <a href="https://www.youtube.com/channel/UCqTQFBL17FbF0imuzZOUP_A"> <div> <div> <h2>Data Science In Everyday Life</h2> <div><h3>Share your videos with friends, family, and the world</h3></div> <div><p>www.youtube.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*g9nkjEof18yZvZZi)"></div> </div> </div> </a> </div><p id="39f7">In the next phase, I’m going to create a landing page once I have a better idea of curated content I can provide (e.g. courses, a weekly newsletter on the latest and greatest AI advancements relevant for businesses,…) — I haven’t yet nailed this part down yet, but comments are really appreciated!</p><p id="e38f">I look forward to having you all join in my journey of learning, creating and sharing knowledge!</p><p id="13f4"><i>If you like this post, follow me — I write on topics related to applying state-of-the-art NLP in real-world applications and, more generally, on the intersections between data and society.</i></p><p id="f8d1"><i>Feel free to connect with me on <a href="https://www.linkedin.com/in/skanda-vivek-01619311b/">LinkedIn</a>!</i></p><p id="ffde"><i>If you are not yet a Medium member and want to support writers like me, feel free to sign-up through my referral link: <a href="https://skanda-vivek.medium.com/membership">https://skanda-vivek.medium.com/membership</a></i></p></article></body>

Hello and Welcome!

Who I am, why I write, and why you might be interested

First of all — thank you so much for reading this post! While I’ve been writing on Medium for a couple of years now, I haven’t yet gotten to introducing myself on Medium. I’m a data scientist but my journey has not been typical. I started off getting my PhD in physics, doing research, working my way to be a professor in a small college, and writing data science articles here on Medium and mentoring folks along the way. I think writing on Medium in top publications like Towards Data Science, Towards AI, Start It Up, etc. enabled me to smoothly transition to the data science industry. Over the past years, I’ve learnt a lot on Medium — what content works well, what doesn’t, and how to be successful while having fun writing!

Oh and I’m also a dad to twin boys and love to spend quality time with them as well! Believe it or not — my interactions with my sons help me in writing blogs, communicating knowledge, and building larger audiences. This summer, we are taking them swimming a bunch — and are seeing the fruits of their/our hard labor. A couple of weeks back, they were scared of getting in the water — but because of dedicated efforts from their end, as well as some positive feedback from our end — they are finally comfortable with their pool noodles at the deep end! But the ride has not been completely smooth. We have endured our fair share of screaming — before because they were scared, now because they don’t want to get out of the pool!

Consistency adds up, and efforts pay off in the long run. Even if that doesn’t seem true in the moment. Sometimes the best you can do is to stay positive.

What Do I Write About?

Before diving into how I made it here, let me talk about what I write about and why you might be interested. Over the last year, I’ve become proficient in applying state-of-the-art natural language processing (NLP) and AI methods to industry use cases. Some of my more popular articles include topics related to transformers, open-source vs closed-source large language models, and also geospatial data science:

If you are interested in how to apply the latest and greatest AI advancements to industry use cases, you are in the right place. I spend a lot of time researching these latest advancements from all sorts of sources — Research papers and company announcements, social media, courses, etc. I then experiment with these tools — and write accessible blogs for a wide range of folks. Non-technical folks can skim these articles, and technical folks can read through in detail. I also provide code snippets and GitHub links for those of you who are interested in replicating the results/applying to your own use cases. I’ve had many folks reach out on LinkedIn regarding questions on how to use what I demonstrated on their use cases. So feel free to reach out!

My Journey Through Blogs

As promised — I’ll talk about how I reached here, chronicled through my blogs!

The first blog I wrote was in late November, 2019 — about research I was doing as a post-doc, modeling the impacts of large-scale cyber attacks on vehicles. The blog had a very small reach, but I got an email from The Startup — requesting whether I would like to publish my blog in their publication.

At the time I had no idea about the blog — but I realized it was the biggest Medium publication at the time!

That single experience made me seriously consider writing regular blogs. After this, I published my first blog in Towards Data Science about traffic patterns:

Next, I wrote a series of blogs on the “science of everyday phenomena” — which also corresponded to a course that I was teaching at the time and an open-source book I made for the course. I created a new publication “Emergent Phenomena” for these articles:

While these articles on everyday science were not super widely read — they provided me with a routine and consistency by which I paced writing and publishing. It was also during COVID — where the college shutdown for quite a few months and I spent a lot of time writing.

I was getting ~2–3k monthly views at this point, and mentoring beginners in data science — when I decided to transition to the industry. I wrote a blog about this as well:

Soon after, I started experimenting with NLP and fine-tuning transformer based models for custom applications. So I wrote a blog on this:

After this blog, my monthly views became more like 8–10k. Since then, I have written multiple blog articles on NLP, AI, and industrial applications:

When Should You Fine-Tune LLMs?

LLM Economics: ChatGPT vs Open-Source

How Do You Build A ChatGPT-Powered App?

Extractive vs Generative Q&A — Which is better for your business?

Fine-Tune Transformer Models For Question Answering On Custom Data

Unleashing the Power of Generative AI For Your Customers

Build Industry-Specific LLMs Using Retrieval Augmented Generation

After a few such articles on Generative AI to solve real-world business problems, my monthly views rocketed to 30k+!

The Future — A Roadmap

I plan to write exclusively on AI and industry applications over the next year. I’ve seen that this is a niche that is very important because of all the folks trying to adopt Generative AI (ChatGPT/open-source LLMs) into their business needs. There are great folks making fundamental innovations at OpenAI, Google, etc. and writing research papers as well as communicating this research to the public. At the same time, folks at places like Hugging Face and AWS write brilliant tutorials on deploying state of the art open-source LLMs on their platform.

However, there is a lack of information on how to integrate these fundamental innovations to specific industry settings. For example, one thing that was really lacking was a detailed price comparison between open-source and closed-source LLMs. Or a discussion about what sort of LLM to use for what case. This is where I plan to focus my content (and have done so over the last ~10 or so blogs).

So stay tuned as I explore the nitty gritties of AI, Deploying models at scale, cost vs quality, and lots more — packaged in a way that is tailored to solve business needs. I also created a YouTube channel a while ago (where I’m now going to be more active on).

In the next phase, I’m going to create a landing page once I have a better idea of curated content I can provide (e.g. courses, a weekly newsletter on the latest and greatest AI advancements relevant for businesses,…) — I haven’t yet nailed this part down yet, but comments are really appreciated!

I look forward to having you all join in my journey of learning, creating and sharing knowledge!

If you like this post, follow me — I write on topics related to applying state-of-the-art NLP in real-world applications and, more generally, on the intersections between data and society.

Feel free to connect with me on LinkedIn!

If you are not yet a Medium member and want to support writers like me, feel free to sign-up through my referral link: https://skanda-vivek.medium.com/membership

Introduction
AI
Data Science
Tutorial
Writing
Recommended from ReadMedium