Algorithm finds hidden connections between paintings at the Met

Tyler · « **on:** July 30, 2020, 12:00:07 pm »

Algorithm finds hidden connections between paintings at the Met
29 July 2020, 3:00 pm

Art is often heralded as the greatest journey into the past, solidifying a moment in time and space; the beautiful vehicle that lets us momentarily escape the present.

With the boundless treasure trove of paintings that exist, the connections between these works of art from different periods of time and space can often go overlooked. Itâ€™s impossible for even the most knowledgeable of art critics to take in millions of paintings across thousands of years and be able to find unexpected parallels in themes, motifs, and visual styles.

To streamline this process, a group of researchers from MITâ€™s Computer Science and Artificial Intelligence Laboratory (CSAIL) and Microsoft created an algorithm to discover hidden connections between paintings at the Metropolitan Museum of Art (the Met) and Amsterdamâ€™s Rijksmuseum.

Inspired by a special exhibit â€œRembrandt and Velazquezâ€ in the Rijksmuseum, the new â€œMosAIcâ€ system finds paired or â€œanalogousâ€ works from different cultures, artists, and media by using deep networks to understand how â€œcloseâ€ two images are. In that exhibit, the researchers were inspired by an unlikely, yet similar pairing: Francisco de ZurbarÃ¡nâ€™s â€œThe Martyrdom of Saint Serapionâ€and Jan Asselijnâ€™s â€œThe Threatened Swan,â€ two works that portray scenes of profound altruism with an eerie visual resemblance.

â€œThese two artists did not have a correspondence or meet each other during their lives, yet their paintings hinted at a rich, latent structure that underlies both of their works,â€ says CSAIL PhD student Mark Hamilton, the lead author on a paper about â€œMosAIc.â€

To find two similar paintings, the team used a new algorithm for image search to unearth the closest match by a particular artist or culture. For example, in response to a query about â€œwhich musical instrument is closest to this painting of a blue-and-white dress,â€ the algorithm retrieves an image of a blue-and-white porcelain violin. These works are not only similar in pattern and form, but also draw their roots from a broader cultural exchange of porcelain between the Dutch and Chinese.

â€œImage retrieval systems let users find images that are semantically similar to a query image, serving as the backbone of reverse image search engines and many product recommendation engines,â€ says Hamilton. â€œRestricting an image retrieval system to particular subsets of images can yield new insights into relationships in the visual world. We aim to encourage a new level of engagement with creative artifacts.â€

How it works

For many, art and science are irreconcilable: one grounded in logic, reasoning, and proven truths, and the other motivated by emotion, aesthetics, and beauty. But recently, AI and art took on a new flirtation that, over the past 10 years, developed into something more serious.

A large branch of this work, for example, has previously focused on generating new art using AI. There was the GauGAN project developed by researchers at MIT, NVIDIA, and the University of California at Berkeley; Hamilton and othersâ€™ previous GenStudio project; and even an AI-generated artwork that sold at Sothebyâ€™s for $51,000.

MosAIc, however, doesnâ€™t aim to create new art so much as help explore existing art. One similar tool, Googleâ€™s â€œX Degrees of Separation,â€ finds paths of art that connect two works of art, but MosAIc differs in that it only requires a single image. Instead of finding paths, it uncovers connections in whatever culture or media the user is interested in, such as finding the shared artistic form of â€œAnthropoides paradiseaâ€ and â€œSeth Slaying a Serpent, Temple of Amun at Hibis.â€

Hamilton notes that building out their algorithm was a tricky endeavor, because they wanted to find images that were similar not just in color or style, but in meaning and theme. In other words, theyâ€™d want dogs to be close to other dogs, people to be close to other people, and so forth. To achieve this, they probe a deep networkâ€™s inner â€œactivationsâ€ for each image in the combined open access collections of the Met and the Rijksmuseum. Distance between the â€œactivationsâ€ of this deep network, which are commonly called â€œfeatures,â€ was how they judged image similarity.

To find analogous images between different cultures, the team used a new image-search data structure called a â€œconditional KNN treeâ€ that groups similar images together in a tree-like structure. To find a close match, they start at the treeâ€™s â€œtrunkâ€ and follow the most promising â€œbranchâ€ until they are sure theyâ€™ve found the closest image. The data structure improves on its predecessors by allowing the tree to quickly â€œpruneâ€ itself to a particular culture, artist, or collection, quickly yielding answers to new types of queries.

What Hamilton and his colleagues found surprising was that this approach could also be applied to helping find problems with existing deep networks, related to the surge of â€œdeepfakesâ€ that have recently cropped up. They applied this data structure to find areas where probabilistic models, such as the generative adversarial networks (GANs) that are often used to create deepfakes, break down. They coined these problematic areas â€œblind spots,â€ and note that they give us insight into how GANs can be biased. Such blind spots further show that GANs struggle to represent particular areas of a dataset, even if most of their fakes can fool a human.

Testing MosAIc

The team evaluated MosAIcâ€™s speed, and how closely it aligned with our human intuition about visual analogies.

For the speed tests, they wanted to make sure that their data structure provided value over simply searching through the collection with quick, brute-force search.

To understand how well the system aligned with human intuitions, they made and released two new datasets for evaluating conditional image retrieval systems. One dataset challenged algorithms to find images with the same content even after they had been â€œstylizedâ€ with a neural style transfer method. The second dataset challenged algorithms to recover English letters across different fonts. A bit less than two-thirds of the time, MosAIc was able to recover the correct image in a single guess from a â€œhaystackâ€ of 5,000 images.

â€œGoing forward, we hope this work inspires others to think about how tools from information retrieval can help other fields like the arts, humanities, social science, and medicine,â€ says Hamilton. â€œThese fields are rich with information that has never been processed with these techniques and can be a source for great inspiration for both computer scientists and domain experts. This work can be expanded in terms of new datasets, new types of queries, and new ways to understand the connections between works.â€

Hamilton wrote the paper on MosAIc alongside Professor Bill Freeman and MIT undergraduates Stefanie Fu and Mindren Lu. The MosAIc website was built by MIT, Fu, Lu, Zhenbang Chen, Felix Tran, Darius Bopp, Margaret Wang, Marina Rogers, and Johnny Bui, at the Microsoft Garage winter externship program.

Source: MIT News - CSAIL - Robotics - Computer Science and Artificial Intelligence Laboratory (CSAIL) - Robots - Artificial intelligence

Reprinted with permission of MIT News : MIT News homepage

Use the link at the top of the story to get to the original article.

Algorithm finds hidden connections between paintings at the Met

Tyler

Algorithm finds hidden connections between paintings at the Met

Recent Topics

Recent News

Users Online

Articles