Free AI web copilot to create summaries, insights and extended knowledge, download it at here

Abstract

f complexity is design principle at the core of keras. You can start with simpler workflows like <code>sequential</code> models and then when you need more flexibility, override a function with a different component. That means you will use most of the same pipeline but override a single function.</p><p id="ff58">here is an example</p><figure id="d0e9"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*LtTZal5uIPsvyxVe.jpg"><figcaption></figcaption></figure><h1 id="5b0c">Stateless API to work with JAX</h1><p id="35e1">Older Keras was all stateful. This means at each update during training and evaluation of the model, the value of the variables actually changed and there was no access to it. The access to the API makes it possible to use with JAX functions which require these variables to be fully stateless.</p><p id="c597">The stateless API is available for all layers, models, metrics, and optimizers</p><ul><li>All layers and models have a <code>stateless_call()</code> method which mirrors <code>call()</code>.</li><li>All optimizers have a <code>stateless_apply()</code> method which mirrors <code>apply()</code>.</li><li>All metrics have a <code>stateless_update_state()</code> method which mirrors <code>update_state()</code> and a <code>stateless_result()</code> method which mirrors <code>result()</code>.</li></ul><p id="4954">This does not change the way we all have been using Keras. The code does not change but objects generated from it can be used with JAX without affecting TensorFlow and PyTorch.</p><h1 id="9028">Still a pre-release: What's not working</h1><p id="6688"><b>The import order is messed up:</b> LOL in its true fashion with imports of the packages you have to import <code>torch </code>AFTER <code>tensorflow</code>. If you import <code>tensorflow </code>before <code>torch</code>, it will crash</p><p id="d8e9"><b>Integer dtypes with PyTorch</b>: torch does not support or <code>unit16 </code>or <code>unit32</code>. The backend will fall back to <code>int32 </code>and <code>int64 </code>to maintain compatibility of torch with JAX and TensorFlow</p><p id="b1cd"><b>Average Pooling issue: </b>Torch does not have a padding option so the dimensions of the layers may have different dimensions than TF</p><p id="cf96"><b>Using .map() with tf.data pipeline:</b> The .map() inkeras layers and in tf.data pipelines work with only tensorflow backend but not any other backend.</p><p id="3c14"><b>Image layers with channels first or last: </b>Only torch uses the channel_first and other frameworks use channel_last. To keep compatibility keras_core will have to keep swapping the channel_first to last or vice versa. This loses compute efficiency.</p><p id="0540"><b>Sparse NN support: </b>There is no support for sparse types. It is planned for the future.</p><p id="a64e">If you have read it until this point — Thank you! You are a hero (and a Nerd ❤)! I try to keep my readers up to date with “interesting happenings in the AI world,” so please 🔔 <b><i>clap </i></b>| <b><i>follow | <a href="https://ithinkbot.com/subscribe">Subscribe</a> </i>🔔</b></p><p id="fe09">Become a member using the referral: <a href="https://ithinkbot.com/membership">https://ithinkbot.com/membership</a></p><p id="3954">Find me on Linkedin <a href="https://www.linkedin.com/in/mandarkarhade/">https://www.linkedin.com/in/mandarkarhade/</a></p><div id="7a92" class="link-block"> <a href="https://pub.towardsai.net/nsql-first-ever-fully-opensource-sql-foundation-model-f7b501d91ca4"> <div> <div> <h2>Better than GPT-4 for SQL queries: NSQL (Fully OpenSource)</h2> <div><h3>NSQL is a new family of open-source large foundation models (FMs) designed specifically for SQL generation tasks</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*OdaNrmfjoN8TdEA3-YIZmg.gif)"></div> </div> </div> </a> </div><div id="065a" class="link-block"> <a href="https://pub.towardsai.net/how-do-8-smaller-models-in-gpt4-work-7335ccdfcf05"> <div> <div> <h2>How Do 8 Smaller Models in GPT4 Work?</h2> <div><h3>The secret “Model of Experts” is out; let’s understand why GPT4 is so good!</h3></div> <div><p>pub.towardsai.net

Options

</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*JTMlYovgVjEgq8SL1GICyg.gif)"></div> </div> </div> </a> </div><div id="f073" class="link-block"> <a href="https://pub.towardsai.net/gpt-4-8-models-in-one-the-secret-is-out-e3d16fd1eee0"> <div> <div> <h2>GPT-4: 8 Models in One ; The Secret is Out</h2> <div><h3>GPT4 kept the model secret to avoid competition, now the secret is out!</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*vuZbGVQVpcWeEgTB-mr7Dg.png)"></div> </div> </div> </a> </div><div id="d232" class="link-block"> <a href="https://pub.towardsai.net/meet-mpt-30b-a-fully-opensouce-llm-that-outperforms-gpt-3-22f7b1e00e3e"> <div> <div> <h2>Meet MPT-30B: A Fully OpenSouce LLM that Outperforms GPT-3</h2> <div><h3>Releasing two fine-tuned variants, MPT-30B-Instruct and MPT-30B-Chat, that are built on top of MPT-30B</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*E0tjlFP_vR4uDy3t.jpg)"></div> </div> </div> </a> </div><div id="9408" class="link-block"> <a href="https://pub.towardsai.net/forget-lamp-stack-llm-stack-is-here-e628ae85aa3b"> <div> <div> <h2>Forget LAMP Stack: LLM stack is here!</h2> <div><h3>Huggingface has positioned itself as the new standard stack in the NLP/LLM ecosystem. Now the companies are asking for…</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*MU28BwjHagaXXfJL.png)"></div> </div> </div> </a> </div><div id="e2d8" class="link-block"> <a href="https://pub.towardsai.net/meet-gorilla-a-fully-opensource-llm-tuned-for-api-calls-7447c6cbc78"> <div> <div> <h2>Meet Gorilla: A Fully OpenSource LLM Tuned For API Calls</h2> <div><h3>Fewer Hallucinations and better than GPT-4 in writing API calls</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*KLhQcz3L-OlYvj9mk7gA2g.gif)"></div> </div> </div> </a> </div><div id="19cf" class="link-block"> <a href="https://pub.towardsai.net/falcon-40b-a-fully-opensourced-foundation-llm-945dd9824157"> <div> <div> <h2>Falcon-40B: A Fully OpenSourced Foundation LLM</h2> <div><h3>Each Contributor hereby grants Grants to You a perpetual, worldwide, non-exclusive, irrevocable copyright license to…</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*0NqUA09dX97Gd1b6)"></div> </div> </div> </a> </div><div id="f7d4" class="link-block"> <a href="https://pub.towardsai.net/h2oai-releases-fully-opensourced-gpt-9bdcc2fc1f6d"> <div> <div> <h2>H2Oai releases Fully OpenSourced GPT</h2> <div><h3>h2oGPT-20B, h2oGPT-12B v1, and h2oGPT-12B v2 models have been released with Apache 2.0 license (Completely free for…</h3></div> <div><p>pub.towardsai.net</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*xBM-9qH2YVs_25tM2um7Yw.gif)"></div> </div> </div> </a> </div><figure id="4bb9"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*qneuOdUFNZ84xLfLtGaHLw.gif"><figcaption></figcaption></figure></article></body>

Keras 3.0 Is Out. Here Is What You Must Know

This is a preview version and is planned for a full release in the Fall of 2023!

Salient points!

Keras 3.0 is a full rewrite of the codebase
The backend is now modular
It can run on any arbitrary framework like TensorFlow, JAX, and Pytorch.

The new Keras will be known as keras_core. This means that using import keras_core as keras in place of from tensorflow import keras is all you need. It should be able to run the same code without any issues.

History of Keras

Before 2018, Keras was multi-backend and it could run on Theno, TensorFlow, CNTK, and MXNet. But Keras made a decision that the focus needed to push only the TensorFlow backend only as TF was the backend used the most commonly and was becoming universal.

According to the 2023 Stackoverflow Developer Survey and the 2022 Kaggle Machine Learning & Data Science Survey TensorFlow has been 55% and 60% of the market share and PyTorch has been 40–45%. JAX has been a smaller part of the market share but it has been the go-to backend of the Google DeepMind, Midjourney, Cohere, and some other GenAI projects.

Main Features of Keras Core

Cross-framework low-level language implementation for Deep Learning

Deep learning layers and pre-trained models created using keras_core will work exactly the same way in any framework. Especially keras_core.ops namespace is a cross-functional space that contains —

A full implementation of NumPy API: these implementations include critical functions like ops.matmul, ops.sum, ops.stack, ops.einsum etc.
A neural-network-specific functions like ops.softmax, ops.binary_crossentropy, ops.conv etc.

You can develop custom components using Keras_core and then deploy them using whichever backend works for you, or you can use your framework of choice (locked in).

Additionally, Low-level implementations of -

JAX training loop to train a Keras model using an optax optimizer, jax.grad, jax.jit, jax.pmap.
TensorFlow training loop to train a Keras model using tf.GradientTape and tf.distribute.
Low-level PyTorch training loop to train a Keras model using a torch.optim optimizer, a torch loss function, and the torch.nn.parallel.DistributedDataParallel wrapper.
Use a Keras layer or model as part of a torch.nn.Module.

This means that PyTorch users can start leveraging Keras models whether or not they use Keras APIs! You can treat a Keras model just like any other PyTorch Module.

The same Cross-framework approach will work for pipelines with all backends

tf.data.Dataset pipelines: the reference for scalable production ML.
torch.utils.data.DataLoader objects.
NumPy arrays and Pandas data frames.
keras_core.utils.PyDataset objects.

Keras_core.Applications namespace

keras_core.applications is the namespace where 40 Keras application models are available in all the backends. The vast array of pre-trained models in KerasCV and KerasNLP (e.g. BERT, T5, YOLOv8, Whisper, etc.) also work with all backends.

Edit what you want incrementally

A progressive disclosure of complexity is design principle at the core of keras. You can start with simpler workflows like sequential models and then when you need more flexibility, override a function with a different component. That means you will use most of the same pipeline but override a single function.

here is an example

Stateless API to work with JAX

Older Keras was all stateful. This means at each update during training and evaluation of the model, the value of the variables actually changed and there was no access to it. The access to the API makes it possible to use with JAX functions which require these variables to be fully stateless.

The stateless API is available for all layers, models, metrics, and optimizers

All layers and models have a stateless_call() method which mirrors __call__().
All optimizers have a stateless_apply() method which mirrors apply().
All metrics have a stateless_update_state() method which mirrors update_state() and a stateless_result() method which mirrors result().

This does not change the way we all have been using Keras. The code does not change but objects generated from it can be used with JAX without affecting TensorFlow and PyTorch.

Still a pre-release: What's not working

The import order is messed up: LOL in its true fashion with imports of the packages you have to import torch AFTER tensorflow. If you import tensorflow before torch, it will crash

Integer dtypes with PyTorch: torch does not support or unit16 or unit32. The backend will fall back to int32 and int64 to maintain compatibility of torch with JAX and TensorFlow

Average Pooling issue: Torch does not have a padding option so the dimensions of the layers may have different dimensions than TF

Using .map() with tf.data pipeline: The .map() inkeras layers and in tf.data pipelines work with only tensorflow backend but not any other backend.

Image layers with channels first or last: Only torch uses the channel_first and other frameworks use channel_last. To keep compatibility keras_core will have to keep swapping the channel_first to last or vice versa. This loses compute efficiency.

Sparse NN support: There is no support for sparse types. It is planned for the future.

If you have read it until this point — Thank you! You are a hero (and a Nerd ❤)! I try to keep my readers up to date with “interesting happenings in the AI world,” so please 🔔 clap | follow | Subscribe 🔔

Become a member using the referral: https://ithinkbot.com/membership

Find me on Linkedin https://www.linkedin.com/in/mandarkarhade/

Better than GPT-4 for SQL queries: NSQL (Fully OpenSource)

NSQL is a new family of open-source large foundation models (FMs) designed specifically for SQL generation tasks

pub.towardsai.net

How Do 8 Smaller Models in GPT4 Work?

The secret “Model of Experts” is out; let’s understand why GPT4 is so good!

pub.towardsai.net

GPT-4: 8 Models in One ; The Secret is Out

GPT4 kept the model secret to avoid competition, now the secret is out!

pub.towardsai.net

Meet MPT-30B: A Fully OpenSouce LLM that Outperforms GPT-3

Releasing two fine-tuned variants, MPT-30B-Instruct and MPT-30B-Chat, that are built on top of MPT-30B

pub.towardsai.net

Forget LAMP Stack: LLM stack is here!

Huggingface has positioned itself as the new standard stack in the NLP/LLM ecosystem. Now the companies are asking for…

pub.towardsai.net

Meet Gorilla: A Fully OpenSource LLM Tuned For API Calls

Fewer Hallucinations and better than GPT-4 in writing API calls

pub.towardsai.net

Falcon-40B: A Fully OpenSourced Foundation LLM

Each Contributor hereby grants Grants to You a perpetual, worldwide, non-exclusive, irrevocable copyright license to…

pub.towardsai.net

H2Oai releases Fully OpenSourced GPT

h2oGPT-20B, h2oGPT-12B v1, and h2oGPT-12B v2 models have been released with Apache 2.0 license (Completely free for…

pub.towardsai.net