diff --git a/docs/_static/img/image_spec.png b/docs/_static/img/image_spec.png
new file mode 100644
index 0000000..0e3de13
Binary files /dev/null and b/docs/_static/img/image_spec.png differ
diff --git a/docs/components/spec.rst b/docs/components/spec.rst
index ed07c7a..6d3b6eb 100644
--- a/docs/components/spec.rst
+++ b/docs/components/spec.rst
@@ -80,6 +80,51 @@ Table Specification
 Image Specification
 --------------------------
 
+Image data lives in a higher dimensional space than other data types. Unlike lower dimensional spaces, metrics defined based on Euclidean distances (or similar distances) will fail in higher dimensional spaces. This means that measuring the similarity between image samples becomes difficult. 
+
+To address these issues, we use the Neural Tangent Kernel (NTK) based on Convolutional Neural Networks (CNN) to measure the similarity of image samples.  As we all know, CNN has greatly advanced the field of computer vision and is still a mainstream deep learning technique. 
+
+Usage & Example
+^^^^^^^^^^^^^^^^^^^^^^^^^^
+
+In this part, we show that how to generate Image Specification for the training set of the CIFAR-10 dataset. 
+Note that the Image Specification is generated on a subset of the CIFAR-10 dataset with ``generate_rkme_image_spec``. 
+Then, it is saved to file "cifar10.json" using ``spec.save``. 
+
+In many cases, it is difficult to construct Image Specification on the full dataset. 
+By randomly sampling a subset of the dataset, we can construct Image Specification based on it efficiently, with a strong enough statistical description of the full dataset.
+
+.. tip::
+   Typically, sampling 3,000 to 10,000 images is sufficient to generate the Image Specification.
+
+.. code-block:: python
+
+   import torchvision
+   from torch.utils.data import DataLoader
+   from learnware.specification import generate_rkme_image_spec
+
+   SAMPLED_SIZE = 5000
+
+   full_set = torchvision.datasets.CIFAR10(
+      root='./data', train=True, download=True, transform=torchvision.transforms.ToTensor())
+   loader =  DataLoader(full_set, batch_size=SAMPLED_SIZE, shuffle=True)
+   sampled_X, _ = next(iter(loader))
+
+   spec = generate_rkme_image_spec(sampled_X)
+   spec.save("cifar10.json")
+
+Privacy Protection
+^^^^^^^^^^^^^^^^^^^^^^^^^^
+
+In the third row of the figure, we show the eight pseudo-data with the largest weights :math:`\beta` in the Image Specification generated on the CIFAR-10 dataset.
+Notice that the Image Specification generated based on Neural Tangent Kernel (NTK) protects the user's privacy very well.
+
+In contrast, we show the performance of the RBF kernel on image dat in the first row of the figure below. 
+The RBF not only exposes the real data (plotted in the corresponding position in the second row), but also fails to fully utilise the weights :math:`\beta`.
+
+.. image:: ../_static/img/image_spec.png
+   :align: center
+
 Text Specification
 --------------------------