As I am still a beginner when it comes to machine learning, I am looking to broaden my knowledge and try something that would challenge me. Due to my past projects, I have developed more interest in text generation and word vectors and am building on this knowledge for my final. Therefore, I am going to train a text generation model and create a web application called “AI Thoughts.” Ideally, I would like to create an interface where users can chat with the model I’ve trained and ask it questions.
To do this, I am going to use the ML5 text generator model and train it on data which I have found (more information can be found here). Previously, I tested an example made by Keras (link here), but the results were not good. While I could use a model provided by Tensorflow, I am unsure as to how to connect this model to JavaScript. While I could figure out how to do this, I would rather test the ML5 model first as this would be the most streamlined option. The ML5 model also utilizes LTSMs (long term short memory layers) which are a crucial piece in RNNs when working with text. In terms of datasets, I found one based off of New York Times articles, and it has been shown to give interesting results when paired with text generation. Along with this, I have found others based off of user reviews and books. If the dataset I use turns out to not be large enough, I will try and find one which will give a more accurate result.
The purpose of this project is mainly to give me a better understanding of machine learning and its implementations. I feel that this project will be very exploratory and gives lots of room for new knowledge. My interest in word vectors and text generation comes from the idea that language is essential to humans, and I find it interesting to see how this is processed by a machine. Along with this, word vectors are being used for translation and text generation technology is being used in many different areas, and hence has many practical uses. As I am becoming more acquainted with machine learning techniques, I hope to build upon my previous work and form something new and more advanced. But for now, I am still learning.