Social Graph of New York

Social Graph of New York

A social graph of New York’s well-to-do is generated from the image captions of over a thousand website pages.

Note: The code and the result cannot be publicly displayed due to copywriting by The Data Incubator; however, they can be sent privately upon request.

The website links and party dates were pulled from the New York Social Diary’s Party Picture Archives. Then, after HTML parsing and a great deal of text cleaning, I could assemble a graph of everyone appearing in any caption of any party in the archive. The edge weights were determined based on the number of times two nodes had been included in the same photo caption.

Learning Points: HTML parsing, string parsing, BeautifulSoup, graph manipulation


© 2023. All rights reserved.