How can Tensorflow be used to find the state of preprocessing layer in dataset using Python?


Tensorflow is a machine learning framework that is provided by Google. It is an open-source framework used in conjunction with Python to implement algorithms, deep learning applications, and much more. It is used in research and for production purposes. It has optimization techniques that help in performing complicated mathematical operations quickly.

This is because it uses NumPy and multi-dimensional arrays. These multi-dimensional arrays are also known as ‘tensors’. The framework supports working with a deep neural network. It is highly scalable and comes with many popular datasets. It uses GPU computation and automates the management of resources. It comes with multitude of machine learning libraries and is well-supported and documented. The framework has the ability to run deep neural network models, train them, and create applications that predict relevant characteristics of the respective datasets.

The ‘tensorflow’ package can be installed on Windows using the below line of code −

pip install tensorflow

Tensor is a data structure used in TensorFlow. It helps connect edges in a flow diagram. This flow diagram is known as the ‘Data flow graph’. Tensors are nothing but a multidimensional array or a list.

We are using Google Colaboratory to run the below code. Google Colab or Colaboratory helps run Python code over the browser and requires zero configuration and free access to GPUs (Graphical Processing Units). Colaboratory has been built on top of Jupyter Notebook. Following is the code snippet −

Example

print("A text-only dataset without labels is prepared")
train_text = raw_train_ds.map(lambda text, labels: text)
print("The adapt method is called")
binary_vectorize_layer.adapt(train_text)
int_vectorize_layer.adapt(train_text)
print("The result is displayed on the console")
def binary_vectorize_text(text, label):
   text = tf.expand_dims(text, -1)
   return binary_vectorize_layer(text), label

Code credit − https://www.tensorflow.org/tutorials/load_data/text

Output

A text-only dataset without labels is prepared
The adapt method is called
The result is displayed on the console

Explanation

  • The dataset without using the labels is prepared.

  • A method named ‘adapt’ is called on this data.

  • This will vectorize the dataset using the ‘binary’ format of the model.

Updated on: 18-Jan-2021

56 Views

Kickstart Your Career

Get certified by completing the course

Get Started
Advertisements