Homophily is the seemingly ubiquitous tendency for people to connect and interact with other individuals who are similar to them. This is a well-documented principle and is fundamental for how society organizes. Although many social interactions occur in groups, homophily has traditionally been measured using a graph model, which only accounts for pairwise interactions involving two individuals. Here, we develop a framework using hypergraphs to quantify homophily from group interactions. This reveals natural patterns of group homophily that appear with gender in scientific collaboration and political affiliation in legislative bill co-sponsorship, and also reveals distinctive gender distributions in group photographs, all of which cannot be fully captured by pairwise measures. At the same time, we show that seemingly natural ways to define group homophily are combinatorially impossible. This reveals important pitfalls to avoid when defining and interpreting notions of group homophily, as higher-order homophily patterns are governed by combinatorial constraints that are independent of human behavior but are easily overlooked.
翻译:同性恋是人们与与之相似的其他个人进行联系和互动的看似无处不在的倾向。 这是一个有充分记录的原则,是社会组织方式的根本。 虽然许多社会互动发生在群体中,但通常使用图形模型来衡量同质性互动,该模型只说明涉及两个个人的双向互动。在这里,我们开发了一个框架,用高压图来从群体互动中量化同质性。这揭示了在科学协作和立法法案共同赞助者的政治派别中出现与性别相同的群体自然模式,还揭示了群体照片中独特的性别分布,所有这些都无法通过对称措施完全捕捉到。 同时,我们展示了以貌似自然的方式来界定同质性群体是不可能的。这揭示了在界定和解释同质性群体概念时避免的重要陷阱,因为较高等级的同性模式受独立于人类行为但容易被忽视的组合制约。