AutoKeras Tutorial: Automated Deep Learning with Keras

You know what’s exhausting? Spending three days tweaking neural network architectures, trying different layer combinations, adjusting hyperparameters, and still ending up with mediocre results. I’ve been there, staring at my screen at 2 AM, wondering if adding another dropout layer will finally push my accuracy above 85%.

Enter AutoKeras. It’s basically having a deep learning expert sitting next to you, automatically testing architectures and hyperparameters while you grab coffee. No PhD required, no endless experimentation sessions, just clean results. Sounds too good to be true? Let me show you why it’s not.

What Exactly Is AutoKeras?

AutoKeras is an AutoML library built on top of Keras that automatically searches for the best neural network architecture for your specific problem. Think of it as a smart assistant that knows all the tricks you’d spend months learning.

The library uses something called Neural Architecture Search (NAS) to explore different model configurations. Instead of you manually deciding “should I use 64 or 128 neurons in this layer?”, AutoKeras tries multiple combinations and picks the winner.

I remember my first AutoKeras project. I had a computer vision task that I’d been manually tuning for a week. Out of curiosity, I let AutoKeras run overnight. The next morning? It had found an architecture that beat my hand-tuned model by 7%. That stung a little, not gonna lie :)

Why Use AutoKeras Instead of Manual Design?

Saves massive amounts of time on architecture search
Explores configurations you might never think of on your own
Handles preprocessing automatically (scaling, encoding, etc.)
Works for beginners and experts alike
Completely open-source and actively maintained

Ever wondered why your carefully crafted network performs worse than something AutoKeras spits out in an hour? Yeah, that’s the power of automated search at work.

Installation and Setup

Getting started is dead simple. You’ll need Python 3.7+ and TensorFlow 2.3+:

bash

pip install autokeras

That’s it. Seriously. No complicated dependencies or configuration files. The package handles everything for you.

Here’s your basic import setup:

python

import autokeras as ak
import tensorflow as tf
import numpy as np

One quick heads-up: AutoKeras can be memory-intensive during the search process. If you’re on a laptop with limited RAM, you might want to reduce the number of trials (I’ll show you how later).

Your First AutoKeras Model: Image Classification

Let me walk you through the simplest possible example. We’ll use the CIFAR-10 dataset because it’s readily available and perfect for testing.

Loading and Preparing Data

python

from tensorflow.keras.datasets import cifar10

# Load the data
(x_train, y_train), (x_test, y_test) = cifar10.load_data()

# AutoKeras handles normalization, but let's check shapes
print(f"Training data shape: {x_train.shape}")
print(f"Training labels shape: {y_train.shape}")

Notice anything? I’m not normalizing the images or doing any preprocessing. AutoKeras figures this out automatically. That alone saves you from countless “wait, did I forget to normalize?” moments.

Training the Model

Here’s where the magic happens:

python

# Initialize the image classifier
clf = ak.ImageClassifier(
    max_trials=10,
    overwrite=True,
    directory='autokeras_output'
)

# Train it
clf.fit(x_train, y_train, epochs=30, validation_split=0.2)

# Evaluate
accuracy = clf.evaluate(x_test, y_test)
print(f"Test Accuracy: {accuracy[1]}")

That’s it. You just trained a deep learning model without specifying:

Number of layers
Filter sizes
Activation functions
Optimizer settings
Learning rate schedules

AutoKeras tested 10 different architectures and picked the best one. The max_trials parameter controls how many architectures it explores—more trials generally mean better results but longer training time.

What’s Happening Under the Hood?

During training, you’ll see AutoKeras trying different configurations. It’s using Bayesian optimization to intelligently explore the search space rather than randomly trying everything.

Each trial builds on what it learned from previous ones. If adding a batch normalization layer helped in trial 3, trials 4 and 5 might experiment with where to place it. It’s actually pretty clever when you watch it work.

Text Classification Made Easy

Let’s tackle something different — sentiment analysis. I recently used AutoKeras for a customer review classification project, and the results were impressive.

Preparing Text Data

python

import pandas as pd

# Sample data (in reality, load your dataset)
texts = [
    "This product is amazing! Best purchase ever.",
    "Terrible quality. Complete waste of money.",
    "It's okay, nothing special.",
    # ... more examples
]
labels = [1, 0, 2, ...]  # positive, negative, neutral

# Convert to numpy arrays
x_train = np.array(texts)
y_train = np.array(labels)

Building the Text Classifier

python

# Initialize text classifier
text_clf = ak.TextClassifier(
    max_trials=15,
    overwrite=True
)

# Train
text_clf.fit(
    x_train, 
    y_train,
    epochs=20,
    validation_split=0.2
)

# Make predictions
predictions = text_clf.predict(["This is a test review"])

AutoKeras automatically handles:

Text tokenization
Embedding selection (might try different embedding dimensions)
Sequence processing (LSTMs, CNNs, or Transformers)
Output layer configuration

FYI, the library is smart enough to recognize when you’re working with text and applies appropriate preprocessing without you having to specify anything.

Structured Data: The Bread and Butter

Most real-world problems involve tabular data. AutoKeras handles this beautifully with the StructuredDataClassifier.

Working with Tabular Data

python

import pandas as pd

# Load your dataset
df = pd.read_csv('your_data.csv')

# Separate features and target
X = df.drop('target_column', axis=1)
y = df['target_column']

# Initialize classifier
clf = ak.StructuredDataClassifier(
    max_trials=20,
    overwrite=True
)

# Train
clf.fit(
    X, 
    y,
    epochs=50,
    validation_split=0.15
)

Here’s what blows my mind: AutoKeras automatically detects column types. It figures out which columns are categorical, which are numerical, and applies appropriate preprocessing to each. No manual encoding required.

Feature Engineering on Autopilot

Traditional feature engineering is tedious. Should you normalize? Standardize? One-hot encode? With AutoKeras, the answer is “let the algorithm decide.”

The library tries different preprocessing combinations:

Normalization vs. standardization for numerical features
One-hot encoding vs. embedding for categorical features
Feature interactions (sometimes it combines features creatively)

It’s like having a data scientist who never gets tired testing different approaches.

Customizing the Search Space

Sometimes you want some control without going full manual mode. AutoKeras lets you guide the search while still automating most decisions.

Limiting Search Parameters

python

clf = ak.ImageClassifier(
    max_trials=15,
    overwrite=True,
    objective='val_accuracy',  # What to optimize for
    max_model_size=10000000,   # Limit model size (bytes)
    tuner='bayesian',          # Search strategy
    seed=42                    # Reproducibility
)

The objective parameter is particularly useful. You might prioritize:

val_accuracy for classification tasks
val_loss for regression
val_precision or val_recall for imbalanced datasets

Setting Resource Constraints

Got limited time or compute? Constraint the search:

python

clf = ak.ImageClassifier(
    max_trials=5,              # Fewer architectures
    epochs=10,                 # Faster training per trial
    overwrite=True
)

# Or use time-based limits
import time

start_time = time.time()
max_search_time = 3600  # 1 hour in seconds

clf.fit(
    x_train, 
    y_train,
    callbacks=[
        tf.keras.callbacks.TimedStopping(
            seconds=max_search_time
        )
    ]
)

This is crucial for real-world projects where you can’t just let the search run indefinitely. I usually start with 5–10 trials to get a baseline, then increase if results aren’t satisfactory.

Advanced Features Worth Knowing

Multi-Input and Multi-Output Models

AutoKeras handles complex architectures surprisingly well. Need to process images AND metadata together?

python

# Define inputs
image_input = ak.ImageInput()
structured_input = ak.StructuredDataInput()

# Define output
output = ak.ClassificationHead()

# Build the multi-modal model
model = ak.AutoModel(
    inputs=[image_input, structured_input],
    outputs=output,
    max_trials=10
)

# Train with multiple inputs
model.fit(
    [image_data, structured_data],
    labels,
    epochs=30
)

I used this approach for a medical imaging project where we combined X-ray images with patient demographics. The multi-modal architecture AutoKeras found outperformed our separate models by a significant margin.

Export and Deployment

Once you’ve found your winning model, you’ll want to use it in production:

python

# Export the best model as Keras
best_model = clf.export_model()

# Save it
best_model.save('my_autokeras_model.keras')

# Load later for inference
loaded_model = tf.keras.models.load_model('my_autokeras_model.keras')
predictions = loaded_model.predict(new_data)

The exported model is a standard Keras model, so it works with all TensorFlow deployment tools — TF Serving, TF Lite, you name it.

Common Pitfalls and How to Avoid Them

Let me save you some frustration based on mistakes I’ve definitely not made repeatedly :/

1. Running Too Many Trials on Limited Hardware

Don’t do this:

python

clf = ak.ImageClassifier(max_trials=100)  # Your laptop will cry

Start small, scale up:

python

clf = ak.ImageClassifier(max_trials=5)   # Test the waters first

2. Ignoring the Validation Split

AutoKeras needs validation data to compare architectures. If you don’t provide it, results might not generalize:

python

# GOOD
clf.fit(x_train, y_train, validation_split=0.2)

# BETTER - use your own validation set
clf.fit(x_train, y_train, validation_data=(x_val, y_val))

3. Not Setting overwrite=True

Forgetting this causes AutoKeras to reuse results from previous runs, which can lead to really confusing behavior when you’re iterating:

python

clf = ak.ImageClassifier(
    max_trials=10,
    overwrite=True  # ALWAYS include this during development
)

4. Expecting Miracles with Bad Data

AutoKeras is powerful, but it can’t fix fundamentally flawed datasets. Garbage in, garbage out still applies. If your data has:

Class imbalance issues (address with class weights)
Data leakage problems (fix your train/test split)
Insufficient examples (collect more data)

No amount of AutoML will save you.

AutoKeras vs. Manual Architecture Design

Real talk: when should you use AutoKeras versus building models manually?

Use AutoKeras when:

You’re starting a new project and need a strong baseline
You don’t have deep expertise in architecture design
You want to explore unconventional architectures
Time is limited and you need results quickly

Manual design when:

You need fine-grained control over every layer
You’re implementing cutting-edge research architectures
Interpretability of the architecture is critical
You’re optimizing for specific hardware constraints

IMO, the best approach is hybrid: use AutoKeras to find a great baseline, then manually refine if needed. You get the benefits of automated search plus your domain expertise.

Performance Tuning Tips

Want to squeeze more performance out of AutoKeras? Here’s what actually works:

Increase Trials Strategically

python

# Progressive approach
# Start: 5 trials (1-2 hours)
# If promising: 15 trials (overnight)
# If still improving: 30+ trials (weekend run)

More trials = better results, but returns diminish after a point.

Use Data Augmentation

For image tasks, augmentation helps tremendously:

python

from tensorflow.keras.preprocessing.image import ImageDataGenerator

datagen = ImageDataGenerator(
    rotation_range=20,
    width_shift_range=0.2,
    height_shift_range=0.2,
    horizontal_flip=True
)

# Then use with AutoKeras
clf.fit(datagen.flow(x_train, y_train), epochs=30)

Monitor with TensorBoard

AutoKeras integrates with TensorBoard for visualization:

python

tensorboard_callback = tf.keras.callbacks.TensorBoard(
    log_dir='./logs'
)

clf.fit(
    x_train, 
    y_train,
    callbacks=[tensorboard_callback]
)

Watch your trials progress in real-time. It’s oddly satisfying watching different architectures compete.

Final Thoughts

AutoKeras has genuinely changed how I approach deep learning projects. The days of spending weeks manually architecting networks for every task? Pretty much over. Now I let AutoKeras handle the grunt work while I focus on data quality, problem formulation, and interpreting results.

Is it perfect? No. You’ll occasionally want more control than it offers. The search process can be time-consuming. And sometimes — sometimes — a simple manually-designed model is all you need.

But for the vast majority of projects? AutoKeras delivers excellent results with minimal effort. It democratizes deep learning in a way that’s genuinely exciting. You don’t need to be a neural architecture expert to build competitive models anymore.

Start with a small project. Let AutoKeras run through its search process. Compare the results to what you’d build manually. I’m betting you’ll be impressed, maybe even a little annoyed at how well it works. Welcome to the AutoML revolution, friend. Your 2 AM hyperparameter tuning sessions are officially optional now.

Sam Austin

Search This Blog

Latest Post

Reinforcement Learning for Credit Scoring: Applications in Fintech