avatarCarmen Micsa, MA in English, podcaster

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

5775

Abstract

e other hand, the <b>GPT-4 Turbo Vision</b> <b>preview </b>impresses with a vast context window of 128,000 tokens.</p><p id="dcf7"><i>Note: For a more in-depth understanding of tokens, take a moment to visit this <a href="https://help.openai.com/en/articles/4936856-what-are-tokens-and-how-to-count-them">link</a>.</i></p><h1 id="1c53">Tested Use-Cases using Gemini:</h1><h2 id="1f18">1] Image Interpretation</h2><p id="8a88">To test the image interpretation capabilities of Gemini I loaded a few images into Bard and posed questions that were based on Basic Object Recognition, Contextual Understanding, chart analysis, etc.</p><p id="257e">I tested basic object recognition by asking, “Can you identify the main subject in the picture?” and the model correctly identified the main subject.</p><figure id="5371"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*lnqXvKWNDxUu-Q1Mek0fYw.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><p id="5278">Then I uploaded a bar graph depicting cost analysis for different AI models and asked the model to “Analyse the graph.”</p><p id="c087">While it correctly identified the models listed, it struggled to interpret the information and provided an incorrect explanation.</p><figure id="dbdd"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*n6ZGgZpXtvhnwn2kZyzBVg.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><p id="b832">In contrast, GPT-4 provided a thorough explanation of the graph, pointing out and explaining every crucial detail. Therefore, GPT-4 excels in interpreting images.</p><figure id="52f9"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Ge4LACOXW_Z51fAOQjMaZA.png"><figcaption>Response from GPT-4</figcaption></figure><h2 id="1bf1">2] Knowledge of Finance and Investment</h2><p id="168d">To assess Gemini’s understanding of finance and investment, I asked the following question: “What are the important questions to ask before putting money into financial products?”</p><p id="cf80">The model provided quite long and yet impressive responses.</p><figure id="277f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*fu0bg5jEPcJACPZmgMRhJQ.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><h2 id="0170">3] Access to Real-Time Data</h2><p id="31ec">Another super cool feature of Gemini is access to real-time information. It has been empowered to pull real-time data from other Google applications including Docs, Maps, Lens, Flights, Hotels, YouTube, and more. Also, its services are not limited to their knowledge to any date.</p><p id="763f">To test this feature, I asked:</p><p id="cd30">“Explain the training approaches used for the GPT-4 and Gemini models.”</p><figure id="d216"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*WrjzFaZvEtpbDPmfmMtXKQ.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><p id="75fc">After thoroughly reviewing the responses, I found that the information provided was almost up to date; however, verifying the answers provided by AI models is crucial.</p><h2 id="1d76">4] Checking Math Ability</h2><p id="5840">Basic math, advanced math, algebraic, and temporal understanding questions were posed. Surprisingly, Gemini failed in basic math, advanced math, and temporal comprehension.</p><figure id="e4dd"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*01GO2XjW2fREqNKiDg7Now.png"><figcaption><i>The correct answer to the above question is 6.</i></figcaption></figure><p id="5dc5">When a bit more complicated problem was proposed as:</p><p id="dd2a"><b>Question</b>: “A debate club consists of 6 girls and 4 boys. A team of 4 members is said to be selected from this club including the selection of a captain (from among these 4 members) for the team. If the team has to be included at most one boy, then find the number of ways of selecting the team.”</p><p id="49a6">Sadly, Gemini could not solve the above problem.</p><figure id="c9f8"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*ueTEhoqMqKUdzaNhfbOLog.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><p id="a41a">However, the GPT-4 model gave the right answer with explanations. Here is the response from the GPT-4 model.</p><figure id="2181"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*N7MeE4riVY7kcZhqDGdhjg.png"><figcaption>Response from Gpt-4</figcaption></figure><h2 id="3dff">5] Data analysis on the US Unemployment Dataset.</h2><p id="218d">Gemini was given a dataset from <a href="https://www.kaggle.com/datasets/aniruddhasshirahatti/us-unemployment-dataset-2010-2020">Kaggle.com</a>, which contained US unemployment rates, and was tasked with generating an article.</p><figure id="6a55"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*kd5-KU3JyaqOlHB1H6zfsg.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><p id="f012">Gemini and GPT-4 both provided useful information and helpful insights by reading the dataset.</p><p id="4c92">Here is the response from GPT-4:</p><figure id="f107"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*30xCwXs45RxAWKw3IePGRA.png"><figcaption>Response from GPT-4</figcaption></figure><h2 id="aad3">6] Summarization and Reading Comprehension</h2><p id="1793">Next, Gemini was tested on its ability to comprehend a paragraph about globalization and answer questions related to it.</p><p id="8d76">Prompt given: Here is a paragraph: <paragraph> …. </paragraph>. Now answer the following questions. Question 1: What is globalization and what are the driving forces behind it? Question 2: What are some positive aspects of globalization? Question 3: What are some negative consequences of

Options

globalization?</p><figure id="6e5e"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*baBwS-UPDbvMprFdQCYrqA.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><p id="834d">Gemini gave correct answers to all the questions along with relevant explanations.</p><p id="ca68">Even models like GPT-4 and Claude, with higher context lengths, exhibit similar proficiency in delivering accurate responses with detailed reasoning.</p><h2 id="4e16">7] Science Q&A</h2><p id="3f3f">Various questions on science topics, including metabolic processes and physics, were posed in both single-line and multiple-choice formats. Gemini successfully provided accurate answers to all the questions.</p><figure id="2d75"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*zaCqLideAEpPUDDU5cTJew.png"><figcaption></figcaption></figure><figure id="196e"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*_FdYfgWipz7r6-r28pR1MQ.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><h2 id="de83">8] Coding Ability</h2><p id="a067">Gemini was asked to write a Python program designed to accept user inputs. The task required creating a program that could interact with users.</p><p id="8fb0">“Create a Python function named find_greater_than() that prompts the user for two inputs: a list of numbers and an integer threshold. The function should generate a new list containing all numbers from the input list that exceed the specified threshold. The order of numbers in the resulting list should mirror that of the input list.”</p><figure id="aca9"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*nTA9dIR8YhTdNxJjxQxlmQ.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><p id="5fca">Despite explicitly instructing Gemini to consider input from the user, it failed to understand that.</p><p id="0d90">On the contrary, when the same question was asked to GPT-4, it provided an ideal coding solution. The response was well-structured, with each line thoroughly explained through comments. GPT-4 flawlessly executed the task by taking user input and delivering the required solution with precision.</p><figure id="35ae"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Ib2i9qXq_h6FOXRWwYxAQw.png"><figcaption></figcaption></figure><h2 id="f102">9] Understanding the Medical Domain</h2><p id="b9be">While Gemini appears well-informed about medications and their potential side effects, relying solely on AI recommendations would be unwise.</p><p id="444d">It’s important to approach AI-generated information with caution and seek advice from trusted human sources in matters related to your health and medications.</p><figure id="f546"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*-yFJ0DM4Kng5YxaL0g-_6Q.png"><figcaption>Screenshot from Bard’s console</figcaption></figure><h2 id="384d">Important Note: Questioning Google’s Line!</h2><p id="0138">Google has been marketing <a href="https://deepmind.google/technologies/gemini/#capabilities">Gemini as the most advanced AI model</a> and has shown evaluation graphs like the one below.</p><figure id="12b2"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*jFlHd9yXHH563ytRtGXcbQ.png"><figcaption>Image from <a href="https://deepmind.google/technologies/gemini/#capabilities">Gemini — Google DeepMind</a></figcaption></figure><p id="5805">Google showed Gemini Ultra is better than human experts, but the way they did it is questionable. They drew a line to show superiority without allowing proper verification of their specific prompting technique.</p><h1 id="fd07">Pricing</h1><figure id="18ce"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*B5Yw7Y4SEbQfaki7jVsxRQ.png"><figcaption></figcaption></figure><p id="d38b">The above graph shows the pricing for different language models per 1000 tokens, based on the prompt (input) and completion (output). <a href="https://blog.google/technology/ai/gemini-api-developers-cloud/">Gemini Pro costs 0.001 for the prompt and 0.002 for the completion per 1000 tokens.</a></p><p id="39ce">Gemini Pro and GPT-3.5 Turbo-16k may be preferable for more cost-sensitive applications, while GPT-4 models offer more advanced capabilities at a higher cost.</p><p id="7834">Whereas Claude-100k, despite having a higher token limit, has a relatively lower cost compared to the GPT-4 models.</p><h1 id="0f50">Conclusion</h1><p id="7e7c">To conclude, the Gemini-Pro model has many advantages, but it also requires some enhancements. Tested use cases provide insights into Gemini’s performance. While it excels in certain areas, such as summarization, reading comprehension, scientific Q&A, and analyzing data, it faces challenges in basic math and advanced math.</p><p id="b4a6">Gemini’s real-time data access is a notable feature, pulling information from various Google applications.</p><p id="fb1b">Gemini’s limitations become apparent in scenarios like image interpretation, where it struggles compared to GPT-4. Additionally, its coding ability falls short when compared to GPT-4. The comparison with the GPT-4 model indicates that Gemini might not outperform in every aspect, and its strengths and weaknesses vary across different tasks.</p><p id="e063">In essence, Gemini showcases notable capabilities, users and developers must consider its performance in specific use cases and carefully evaluate its suitability for diverse applications, keeping in mind the strengths and limitations highlighted in the analysis.</p><h2 id="cbca">About the Author</h2><p id="7d92">Vaishnavi R is a Junior Data Scientist at the <a href="https://www.version1.com/it-service/innovation-labs/">Version 1 AI Labs.</a></p></article></body>

How I Teleported Myself From California to London in Two Hours

Time traveling is a thing! Live in the open air!

Photo taken by Carmen Micsa

Teleporting, which is described as “the hypothetical transfer of matter or energy from one point to another without traversing the physical space between them,” has fascinated my daughter since she was little. She thought it was a more fun and faster way to travel than our 10-hour long family road trips. Not to mention, the best thing that could happen to mankind.

“I should invent a teleporting machine,” Sophia would declare with a sparkle in her blue eyes. “We could get anywhere quickly and not go on these boring road trips anymore.”

My husband and I understood that our kids were not as excited about road trips as we were, but our spacious and sturdy Toyota Sequoia was our only “teleporting machine,” until I discovered a way to time travel on a cold, foggy autumn afternoon.

After the best solo trip to London at the beginning of October, where I went to run London marathon, I was mesmerized with my traveling experience, as if someone had cast a magic spell on me. And that was quite possible, since all the schoolchildren looked as if they had descended from Harry Potter’s world full of magic and wizardry.

After my return to the US, I could not help reminiscing about London’s quaint Marylebone neighborhood, where I stayed in an Airbnb two bedroom-apartment, or flat, as the British would say. I particularly loved the red, white, and brown brick buildings that stood out like wildflowers in a meadow, the coquette restaurants and shops, the immaculate parks, the history, culture, tradition, and civilization.

My exploration of London reminded me that changing my world views while traveling from one continent to another meant that “the walls of my glass tunnel disappeared,” as British philosopher Derek Parfit said. Moreover, I appreciated how kind and genuine the British people were in helping me find my way around the tube stations and the streets of London.

I now live in the open air. There is still a difference between my life and the lives of other people. But the difference is less. Other people are closer. I am less concerned about the rest of my own life, and more concerned about the lives of others.” — Derek Parfit

And while taking pictures of Buckingham Palace, or shopping at The Harrods, felt magical, as Parfit posited, I returned back to America changed and more concerned about our environment, as well as the lives of others.

Time traveling

My mom is visiting us from Romania, and on this overcast and foggy November day, I found a way to time travel for a delectable London afternoon tea and share the experience with her.

Photo taken by one of the servers of Tea List in Davis, CA

Our afternoon tea started with us selecting the tea, after which the server brought us each a British scone that melted in our mouths like a snowflake. My mom particularly loved the Devonshire cream, also called Devon cream or clotted cream, a common dairy product in England — not recommended to someone on a diet — as well as the lemon curd that we topped our scones with.

The white tablecloth. The tea. The scones. The finger sandwiches. The fresh fruit and pastries. The experience.

They all took me back to my fancy afternoon tea at The Rubens and made me smile, realizing that our creativity and imagination can help us travel in time while traversing continents, countries, and oceans.

A royal afternoon tea at The Rubens in London, the day after I ran London Marathon

Almost a month after returning home from my wonderful and successful solo international trip, I decided to time travel back for some afternoon tea, and share the experience with my mom, who was not as impressed by my teleporting abilities. She also did not share my enthusiasm to relive the simple and yet complex act of drinking royal tea paired with scones and finger sandwiches, but she was definitely amused by it.

Photo taken by Carmen Micsa in London

Time travel past and future

Since Charles Dickens depicts mystical time travel in both directions in A Christmas Carol, as the protagonist, Ebenezer Scrooge, is transported to Christmases past and future, I figured that I could do the same by recreating the afternoon tea scene matching my London experience.

Time travel in the past and future can happen when we are imaginative and creative, but mainly when we believe in teleporting ourselves mentally and spiritually to the destination of our choice.

I see my past and future time travel ability as a great way to satisfy my penchant for traveling abroad to experience different cultures, traditions, and new sights.

Ways to make it happen:

  1. Travel to the destination first.
  2. Learn as much as possible about the country and its people.
  3. Live like a local when visiting. I loved going to Waitrose local supermarket, which dazzled me with their fresh and organic products. I felt like Alice in Wonderland just shopping there.
  4. Return home and recreate the experience.

And although my mom was not as enthused and elated about my time travel, our afternoon tea brought us closer together and will create everlasting memories of drinking tea and scarfing down scones topped with lemon curd and Devonshire cream — an experience that we can sip on while straddling two different cultures, continents, and generations.

And, trust me! It’s worth it! Every bite and moment of it!

Literary Impulse
Traveling
Personal Growth
Philosophy
Relationships
Recommended from ReadMedium