Forecasting Stock Index With Deep Learning¶

In this project I will be using multivariate time series forecasting using an LSTM model architecture
The goal is predict the next 5 days "SPY" index prices using the last 30 days prices as well as other features including support and resistance bands and momentum

# Import necessary libraries
import pandas as pd
import tensorflow as tf
import matplotlib.pyplot as plt
import requests
import numpy as np
from keras.utils.vis_utils import plot_model
import datetime
%load_ext tensorboard

# Call AlphaVantage api to fetch "SPY" index data
api_key = 'OAK4S5QP4CT156KH'
ticker = "SPY"
url = f'https://www.alphavantage.co/query?function=TIME_SERIES_DAILY_ADJUSTED&symbol={ticker}&apikey={api_key}'
response = requests.get(url=url, params={'outputsize':'full'})
json_data = response.json()['Time Series (Daily)']

# Read json data into pandas dataframe
df = pd.DataFrame(json_data).transpose()
df.columns = [c.split('. ')[1] for c in df.columns].  # Clean up column names

# convert data to floats and reverse order: older to newer ticker prices
df = df.apply(pd.to_numeric, errors='coerce')
df = df.reindex(index=df.index[::-1])

# Add support and resistance indicators (+/- 2 std devs) from a 30 day moving average
df['ma_close'] = df['close'].rolling(window=5).mean()
ma = df['close'].rolling(window=30).mean()
stdev = df['close'].rolling(window=30).std()
df['upper_boll'] = ma + 2*stdev
df['lower_boll'] = ma - 2*stdev

# Add 5 day percent change
df['5_day_pct_change'] = df['close'].pct_change(5)

# Add future 5 day returns 
df['future_5_return'] = df['5_day_pct_change'].shift(-5)

# Add 5 day volume change
df['5_day_vol_change'] = df['volume'].pct_change(5)

# Add 1 and 5 day momentum
close = df['close']
close_past_5 = df['close'].shift(5)
close_past_1 = df['close'].shift(1)
df['momentum_1'] = close - close_past_1
df['momentum_5'] = close - close_past_5

# drop null rows 
df.dropna(axis=0,inplace=True)
df.head()

# Quick plot to visualize support and resistance bands 
plt.figure(figsize=(12,8))
plt.plot(df['close'][-500:], color='g')
plt.plot(df['ma_close'][-500:], color='black')

plt.plot(ma[-500:], color='b')
plt.plot(df['upper_boll'][-500:], color='r')
plt.plot(df['lower_boll'][-500:], color='r')
plt.show()

Features included in the model (8 total features)¶

5 day moving average of closing prices (What we are trying to forcast)
Volume
Upper Bolling Band (30 day moving average + 2 std)
Lower Bolling Band (30 day moving average - 2 std)
5 day percent change in prices
5 day percent change in volume
1 day momentum
5 day momentum

# extract main features to train on
features_considered = ['ma_close','volume','upper_boll','lower_boll','5_day_pct_change',\
                       '5_day_vol_change', 'momentum_1','momentum_5']
features = df[features_considered]

# create train test split vectors
TRAIN_SPLIT = 4000
dataset = features.values

# Normalize features in dataset based on training set mean and std
data_mean = dataset[:TRAIN_SPLIT].mean(axis=0)
data_std = dataset[:TRAIN_SPLIT].std(axis=0)
dataset = (dataset-data_mean)/data_std

# Rows: days
# Columns: features (ex. ma, momentum, upper_bol etc...)
print(dataset)
print(f'Data Dimensions: {dataset.shape}')

[[ 0.25848119 -1.11792453  0.33450075 ... -0.04503942  0.65700699
   0.04616897]
 [ 0.28347879 -1.12662838  0.33478925 ... -0.04528484  0.51118302
   1.36497234]
 [ 0.31340258 -1.12215401  0.33677529 ... -0.04053144  1.15708918
   1.63885468]
 ...
 [ 5.21359074  0.67715143  5.362143   ... -0.02845171  2.39062527
  -4.93633417]
 [ 5.11929432  0.11471915  5.36951374 ... -0.01532595  1.88387531
  -5.26744711]
 [ 5.07357662  0.13438248  5.3902105  ... -0.01689055  3.93087843
  -2.5666043 ]]
Data Dimensions: (4998, 8)

Transforming Dataset Into Multivariate Format¶

alt text

We are trying to transform our data into an array of shape (# days, 30, 8)
30 represents how many time steps in the past we want to look in order to make our predictions
8 represents how many features we want to analyze for each time step

credit: https://www.tensorflow.org/tutorials/structured_data/time_series

def multivariate_data(dataset, target, start_index, end_index, history_size,
                      target_size, step, single_step=False):
    data = []
    labels = []
    start_index = start_index + history_size
    if end_index is None:
        end_index = len(dataset) - target_size

    for i in range(start_index, end_index):
        indices = range(i-history_size, i, step)
        data.append(dataset[indices])

        if single_step:
            labels.append(target[i+target_size])
        else:
            labels.append(target[i:i+target_size])

    return np.array(data), np.array(labels)

# How many time steps in the past we want to look at to make our prediction
past_history = 30

# How many time steps into the future we want to forcast
future_target = 5

# When set to 1, the function will slide the 30 day window by 1 time step each iteration
STEP = 1

x_train_multi, y_train_multi = multivariate_data(dataset, dataset[:, 0], 0,
                                                 TRAIN_SPLIT, past_history,
                                                 future_target, STEP)
x_val_multi, y_val_multi = multivariate_data(dataset, dataset[:, 0],
                                             TRAIN_SPLIT, None, past_history,
                                             future_target, STEP)

x_train_multi.shape

(3970, 30, 8)

BUFFER_SIZE = 3900
BATCH_SIZE = 360

# Taking x_train and y_train numpy arrays and converting them to a tensorflow dataset
train_data_multi = tf.data.Dataset.from_tensor_slices((x_train_multi, y_train_multi))
# Shuffling and batching data
train_data_multi = train_data_multi.cache().shuffle(BUFFER_SIZE).batch(BATCH_SIZE).repeat()

val_data_multi = tf.data.Dataset.from_tensor_slices((x_val_multi, y_val_multi))
val_data_multi = val_data_multi.batch(BATCH_SIZE).repeat()

# Defining plotting functions

def create_time_steps(length):
    return list(range(-length, 0))

#Plots actual vs. predicted values
def multi_step_plot(history, true_future, prediction):
    plt.figure(figsize=(12, 6))
    num_in = create_time_steps(len(history))
    num_out = len(true_future)

    plt.plot(num_in, np.array(history[:, 0]), label='History')
    plt.plot(np.arange(num_out)/STEP, np.array(true_future), 'bo',
           label='True Future')
    if prediction.any():
        plt.plot(np.arange(num_out)/STEP, np.array(prediction), 'ro',
             label='Predicted Future')
    plt.legend(loc='upper left')
    plt.show()

# Plots training and validation loss
def plot_train_history(history, title):
    loss = history.history['mae']
    val_loss = history.history['val_mae']

    epochs = range(len(loss))

    plt.figure()

    plt.plot(epochs, loss, 'b', label='Training loss')
    plt.plot(epochs, val_loss, 'r', label='Validation loss')
    plt.title(title)
    plt.legend()

    plt.show()

for x, y in train_data_multi.take(1):
    multi_step_plot(x[0], y[0], np.array([0]))

Summary of Model Architecture:¶

I am using 2 LSTM layers with 512 hidden units followed by a Dropout layer, which reduces overfitting in the model. The final dense layer outputs 5 values, which are the predictions for the next five days prices. I use an Adam optimizer and a "mean squared error" loss function. The clipvalue = 4 ensures that the model doesn't experience exploding gradients, which is a common problem with deep recurrent network architectures.

To learn more about LSTM's see https://en.wikipedia.org/wiki/Long_short-term_memory

model = tf.keras.models.Sequential([
                      tf.keras.layers.LSTM(512, return_sequences=True, input_shape=x_train_multi.shape[-2:],activation='relu'),
                      tf.keras.layers.LSTM(512, activation='relu'),
                      tf.keras.layers.Dropout(0.3),
                      tf.keras.layers.Dense(5,activation='linear'),
])
model.compile(optimizer=tf.keras.optimizers.Adam(clipvalue=4.0), loss='mse', metrics=['mae'])

WARNING:tensorflow:Layer lstm_6 will not use cuDNN kernel since it doesn't meet the cuDNN kernel criteria. It will use generic GPU kernel as fallback when running on GPU
WARNING:tensorflow:Layer lstm_7 will not use cuDNN kernel since it doesn't meet the cuDNN kernel criteria. It will use generic GPU kernel as fallback when running on GPU

I define 2 callbacks here. The first will stop model training if the mean average error on the validation set falls below 0.06. The second callback connects with tensorboard so you can visualize your model's training later on.

class myCallback(tf.keras.callbacks.Callback):
  def on_epoch_end(self, epoch, logs={}):
    if logs.get('val_mae') < 0.06:
      print('Done Training')
      self.model.stop_training = True

callbacks = myCallback()

log_dir = "logs/fit/" + datetime.datetime.now().strftime("%Y%m%d-%H%M%S")
tensorboard_callback = tf.keras.callbacks.TensorBoard(log_dir=log_dir)

history = model.fit(train_data_multi, epochs=100,
                                          steps_per_epoch=11,
                                          validation_data=val_data_multi,
                                          validation_steps=2,
                                          verbose=1, callbacks=[callbacks, tensorboard_callback])

Epoch 1/100
11/11 [==============================] - 2s 208ms/step - loss: 0.4300 - mae: 0.4718 - val_loss: 13.6434 - val_mae: 2.6501
Epoch 2/100
11/11 [==============================] - 2s 149ms/step - loss: 0.0831 - mae: 0.2011 - val_loss: 0.8382 - val_mae: 0.7262
Epoch 3/100
11/11 [==============================] - 2s 145ms/step - loss: 0.0261 - mae: 0.1186 - val_loss: 0.1251 - val_mae: 0.2811
Epoch 4/100
11/11 [==============================] - 2s 142ms/step - loss: 0.0179 - mae: 0.0985 - val_loss: 0.1146 - val_mae: 0.2697
Epoch 5/100
11/11 [==============================] - 2s 145ms/step - loss: 0.0154 - mae: 0.0882 - val_loss: 0.0347 - val_mae: 0.1370
Epoch 6/100
11/11 [==============================] - 2s 143ms/step - loss: 0.0139 - mae: 0.0828 - val_loss: 0.0209 - val_mae: 0.1033
Epoch 7/100
11/11 [==============================] - 2s 144ms/step - loss: 0.0122 - mae: 0.0778 - val_loss: 0.0262 - val_mae: 0.1228
Epoch 8/100
11/11 [==============================] - 2s 145ms/step - loss: 0.0125 - mae: 0.0774 - val_loss: 0.0848 - val_mae: 0.2335
Epoch 9/100
11/11 [==============================] - 2s 144ms/step - loss: 0.0134 - mae: 0.0798 - val_loss: 0.0154 - val_mae: 0.0937
Epoch 10/100
11/11 [==============================] - 2s 139ms/step - loss: 0.0129 - mae: 0.0797 - val_loss: 0.0421 - val_mae: 0.1841
Epoch 11/100
11/11 [==============================] - 2s 147ms/step - loss: 0.0124 - mae: 0.0770 - val_loss: 0.0095 - val_mae: 0.0742
Epoch 12/100
11/11 [==============================] - 2s 145ms/step - loss: 0.0109 - mae: 0.0725 - val_loss: 0.0260 - val_mae: 0.1430
Epoch 13/100
11/11 [==============================] - 2s 152ms/step - loss: 0.0113 - mae: 0.0730 - val_loss: 0.0125 - val_mae: 0.0832
Epoch 14/100
11/11 [==============================] - 2s 151ms/step - loss: 0.0117 - mae: 0.0743 - val_loss: 0.0123 - val_mae: 0.0874
Epoch 15/100
11/11 [==============================] - 2s 147ms/step - loss: 0.0105 - mae: 0.0703 - val_loss: 0.0614 - val_mae: 0.2255
Epoch 16/100
11/11 [==============================] - 2s 146ms/step - loss: 0.0099 - mae: 0.0687 - val_loss: 0.0175 - val_mae: 0.1046
Epoch 17/100
11/11 [==============================] - 2s 148ms/step - loss: 0.0095 - mae: 0.0673 - val_loss: 0.0153 - val_mae: 0.0951
Epoch 18/100
11/11 [==============================] - 2s 144ms/step - loss: 0.0089 - mae: 0.0654 - val_loss: 0.0080 - val_mae: 0.0655
Epoch 19/100
11/11 [==============================] - 2s 141ms/step - loss: 0.0092 - mae: 0.0667 - val_loss: 0.0460 - val_mae: 0.1910
Epoch 20/100
11/11 [==============================] - 2s 143ms/step - loss: 0.0096 - mae: 0.0675 - val_loss: 0.0311 - val_mae: 0.1518
Epoch 21/100
11/11 [==============================] - 2s 140ms/step - loss: 0.0096 - mae: 0.0664 - val_loss: 0.0888 - val_mae: 0.2806
Epoch 22/100
11/11 [==============================] - 2s 141ms/step - loss: 0.0106 - mae: 0.0683 - val_loss: 0.0124 - val_mae: 0.0886
Epoch 23/100
11/11 [==============================] - 2s 142ms/step - loss: 0.0128 - mae: 0.0740 - val_loss: 0.3210 - val_mae: 0.5451
Epoch 24/100
11/11 [==============================] - 2s 140ms/step - loss: 0.0105 - mae: 0.0679 - val_loss: 0.0318 - val_mae: 0.1426
Epoch 25/100
11/11 [==============================] - 2s 145ms/step - loss: 0.0089 - mae: 0.0639 - val_loss: 0.0668 - val_mae: 0.2382
Epoch 26/100
11/11 [==============================] - 2s 146ms/step - loss: 0.0099 - mae: 0.0677 - val_loss: 0.0226 - val_mae: 0.1181
Epoch 27/100
11/11 [==============================] - 2s 142ms/step - loss: 0.0083 - mae: 0.0625 - val_loss: 0.0284 - val_mae: 0.1491
Epoch 28/100
11/11 [==============================] - 2s 144ms/step - loss: 0.0096 - mae: 0.0661 - val_loss: 0.0293 - val_mae: 0.1400
Epoch 29/100
11/11 [==============================] - 2s 139ms/step - loss: 0.0082 - mae: 0.0623 - val_loss: 0.0090 - val_mae: 0.0677
Epoch 30/100
11/11 [==============================] - 2s 143ms/step - loss: 0.0083 - mae: 0.0626 - val_loss: 0.0094 - val_mae: 0.0755
Epoch 31/100
11/11 [==============================] - 2s 141ms/step - loss: 0.0079 - mae: 0.0606 - val_loss: 0.0110 - val_mae: 0.0769
Epoch 32/100
11/11 [==============================] - 2s 144ms/step - loss: 0.0083 - mae: 0.0615 - val_loss: 0.0498 - val_mae: 0.1950
Epoch 33/100
11/11 [==============================] - 2s 142ms/step - loss: 0.0098 - mae: 0.0665 - val_loss: 0.0714 - val_mae: 0.2498
Epoch 34/100
11/11 [==============================] - 2s 143ms/step - loss: 0.0091 - mae: 0.0655 - val_loss: 0.0325 - val_mae: 0.1480
Epoch 35/100
11/11 [==============================] - 2s 142ms/step - loss: 0.0075 - mae: 0.0603 - val_loss: 0.0270 - val_mae: 0.1345
Epoch 36/100
11/11 [==============================] - ETA: 0s - loss: 0.0075 - mae: 0.0596Done Training
11/11 [==============================] - 2s 141ms/step - loss: 0.0075 - mae: 0.0596 - val_loss: 0.0070 - val_mae: 0.0598

Uncommenting the next line will allow you to view the results in tensorboard

#%tensorboard --logdir logs/fit

plot_train_history(history, 'Multi-Step Training and validation loss')

Examing the results!¶

As you can see below the model had varied success. In some cases the model accurately predicts the trend of the next 5 days. Note that the y-axis is on its normalized scale. It is important to realize that predicting stock prices is a very, very difficult tasks. There is a lot of information such as news headlines that could help the model's performance

for x, y in val_data_multi.take(5):
    multi_step_plot(x[0], y[0], model.predict(x)[0])

Final Conclusions:¶

Overall, this is a great example of how deep learning can solve difficult tasks like forcasting stock prices. Perhaps with future tweaks, someone can get the model to perform even better!

	open	high	low	close	adjusted close	volume	split coefficient	ma_close	upper_boll	lower_boll	5_day_pct_change	future_5_return	5_day_vol_change	momentum_1	momentum_5
2000-08-03	142.8750	145.8125	142.6250	145.5937	99.5046	4607100	1.0	143.83122	151.453307	141.915400	0.001504	0.007727	-0.397969	1.0000	0.2187
2000-08-04	146.3125	146.7187	145.4062	146.3750	100.0385	3686600	1.0	144.68748	151.463166	141.955540	0.030130	0.007045	-0.408203	0.7813	4.2813
2000-08-07	146.7187	148.4375	146.3750	148.1250	101.2346	4159800	1.0	145.71248	151.531043	142.137664	0.035839	0.007806	-0.209990	1.7500	5.1250
2000-08-08	147.5000	148.8125	147.5000	148.6875	101.6190	3658700	1.0	146.67498	151.654822	142.177431	0.033449	0.003152	-0.072949	0.5625	4.8125
2000-08-09	149.1406	149.2187	147.3750	147.4375	100.7647	5383800	1.0	147.24374	151.687016	142.297324	0.019668	0.008054	-0.276352	-1.2500	2.8438