avatarChristianlauer

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

1306

Abstract

<div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*HEPyS7jwAgqyQAgdLyQCaw.jpeg)"></div> </div> </div> </a> </div><p id="9c77">Now, Google has updated the <a href="https://cloud.google.com/bigquery/docs/load-data-using-cross-cloud-transfer"><code>INSERT INTO SEL</code>ECT statement</a>. You can now filter data from files in Amazon S3 and Azure Blob Storage and append it into BigQuery tables[1].</p><p id="c159">So if you want to realize your data integration via SQL and via BigLake in BigQuery, which is of course a simple practical application case as it is easy to implement, this command of will offer you some new possibilities. You can store a command logic in BigQuery for the respective table or file in the S3 or Azure storage. The logic you would use for this is down below[2]:</p><div id="a028"><pre><span class="hljs-keyword">SELECT</span> table_name, ddl <span class="hljs-keyword">FROM</span> `myproject`.myawsdataset.INFORMATION_SCHEMA.TABLES <span class="hljs-keyword">WHERE</span> table_name <span class="hljs-operator">=</span> <span class="hljs-string">'orders'</span>;</pre></div><p id="19a1">For further information, example code and limitations please use the official Google <a href="https://cloud.g # Options oogle.com/bigquery/docs/load-data-using-cross-cloud-transfer#filter-data">source</a> linked below.</p><div id="acdd" class="link-block"> <a href="https://readmedium.com/upload-data-to-bigquery-via-python-bd31a8e61743"> <div> <div> <h2>Upload Data to BigQuery via Python</h2> <div><h3>How to work with BigQuery by using a Python Notebook</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*4jy7jHnz9Z8WpwQa1ANeqQ.jpeg)"></div> </div> </div> </a> </div><p id="bf22">It will be interesting to see how this topic develops and whether other cloud providers and services will soon be integrated in this way, but this change should certainly mean easier data integration for Data Engineers.</p><h2 id="8e29">Sources and Further Readings</h2><p id="7ced">[1] Google, <a href="https://cloud.google.com/bigquery/docs/release-notes">BigQuery release notes</a> (2023)</p><p id="fdac">[2] Google, <a href="https://cloud.google.com/bigquery/docs/load-data-using-cross-cloud-transfer#filter-data">Load data with cross-cloud operations</a> (2023)</p></article></body>

Google launches better Cross Cloud Operations for BigQuery

How Google BigQuery now lets you filter Data with Insert- Into-Select

Photo by Florian Schneider on Unsplash

Google continues to expand its integration of BigQuery with other systems such as AWS S3 and Azure Blob Storage. For some time now, users have been able to use Google BigLake to analyze platform-independent data using a zero ETL approach or to integrate it using a fixed process.

Now, Google has updated the INSERT INTO SELECT statement. You can now filter data from files in Amazon S3 and Azure Blob Storage and append it into BigQuery tables[1].

So if you want to realize your data integration via SQL and via BigLake in BigQuery, which is of course a simple practical application case as it is easy to implement, this command of will offer you some new possibilities. You can store a command logic in BigQuery for the respective table or file in the S3 or Azure storage. The logic you would use for this is down below[2]:

SELECT
  table_name, ddl
FROM
  `myproject`.myawsdataset.INFORMATION_SCHEMA.TABLES
WHERE
  table_name = 'orders';

For further information, example code and limitations please use the official Google source linked below.

It will be interesting to see how this topic develops and whether other cloud providers and services will soon be integrated in this way, but this change should certainly mean easier data integration for Data Engineers.

Sources and Further Readings

[1] Google, BigQuery release notes (2023)

[2] Google, Load data with cross-cloud operations (2023)

Data Science
Google
Bigquery
Programming
Technology
Recommended from ReadMedium