Repository logo
 
Publication

Learning attribute and homophily measures through random walks

dc.contributor.authorAntunes, Nelson
dc.contributor.authorBanerjee, Sayan
dc.contributor.authorBhamidi, Shankar
dc.contributor.authorPipiras, Vladas
dc.date.accessioned2023-07-26T11:13:18Z
dc.date.available2023-07-26T11:13:18Z
dc.date.issued2023-06-27
dc.date.updated2023-07-01T03:28:49Z
dc.description.abstractWe investigate the statistical learning of nodal attribute functionals in homophily networks using random walks. Attributes can be discrete or continuous. A generalization of various existing canonical models, based on preferential attachment is studied (model class P), where new nodes form connections dependent on both their attribute values and popularity as measured by degree. An associated model class U is described, which is amenable to theoretical analysis and gives access to asymptotics of a host of functionals of interest. Settings where asymptotics for model class U transfer over to model class P through the phenomenon of resolvability are analyzed. For the statistical learning, we consider several canonical attribute agnostic sampling schemes such as Metropolis-Hasting random walk, versions of node2vec (Grover and Leskovec, 2016) that incorporate both classical random walk and non-backtracking propensities and propose new variants which use attribute information in addition to topological information to explore the network. Estimators for learning the attribute distribution, degree distribution for an attribute type and homophily measures are proposed. The performance of such statistical learning framework is studied on both synthetic networks (model class P) and real world systems, and its dependence on the network topology, degree of homophily or absence thereof, (un)balanced attributes, is assessed.pt_PT
dc.description.sponsorshipS. Banerjee is partially supported by the NSF CAREER award DMS-2141621. S. Bhamidi and V. Pipiras are partially supported by NSF DMS-2113662. S. Banerjee, S. Bhamidi and V.Pipiras are partially supported by NSF RTG grant DMS-2134107pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationApplied Network Science. 2023 Jun 27;8(1):39pt_PT
dc.identifier.doi10.1007/s41109-023-00558-3pt_PT
dc.identifier.urihttp://hdl.handle.net/10400.1/19871
dc.language.isoengpt_PT
dc.language.rfc3066en
dc.peerreviewedyespt_PT
dc.publisherSpringer Openpt_PT
dc.rights.holderThe Author(s)
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/pt_PT
dc.subjectAttributed networkspt_PT
dc.subjectHomophilypt_PT
dc.subjectNetwork modelpt_PT
dc.subjectResolvabilitypt_PT
dc.subjectRandom walk samplingspt_PT
dc.subjectDiscrete and continuous attributespt_PT
dc.subjectLearning attribute functionalspt_PT
dc.titleLearning attribute and homophily measures through random walkspt_PT
dc.typejournal article
dspace.entity.typePublication
oaire.citation.issue1pt_PT
oaire.citation.startPage39pt_PT
oaire.citation.titleApplied Network Sciencept_PT
oaire.citation.volume8pt_PT
person.familyNameAntunes
person.givenNameNelson
person.identifier.ciencia-idA31E-F40A-C819
person.identifier.orcid0000-0001-6071-1099
person.identifier.scopus-author-id15063869700
rcaap.rightsopenAccesspt_PT
rcaap.typearticlept_PT
relation.isAuthorOfPublicationaea16f78-689b-426f-b6ad-e76e7b3972fe
relation.isAuthorOfPublication.latestForDiscoveryaea16f78-689b-426f-b6ad-e76e7b3972fe

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
41109_2023_Article_558.pdf
Size:
3.88 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.46 KB
Format:
Item-specific license agreed upon to submission
Description: