-
Thoughts on Web Search & Data Mining Conferences
Recently finished reviewing papers for the Web Search and Data Mining Conference (WSDM) 2023 as a part of the program committee. WSDM is a premier conference in search and data mining on the web and social web. Lots of interesting papers are going to be published soon. I was also a part of the program committee for WSDM […]
-
Influence of First Connections for a New Employee on Growth and Retention
Social network matter in engaging a new member in a community. Are there patterns in initial connections in the new company that influence future retention in the company? How does a new employee in a company network or connect with other employees of the company? Is there any similarity in the company network of a […]
-
Reinventing People You May Know at LinkedIn
People You May Know (PYMK) recommends other people to connect with allowing members to grow their network, and it’s one of the most recognizable feature at LinkedIn. PYMK is responsible for building more than 50% of LinkedIn’s professional graph. The two main challenges in building People You May Know are machine learning and scale. In […]
-
Growth Diffusion at LinkedIn via Cascading Invitations
Figure 1: Example LinkedIn signup cascade Many of the popular websites such as LinkedIn power their growth through guest invitations from existing members to non-members. New members joining can also send such guest invitations resulting in cascade of membership growth at a large scale. How does such cascade of membership growth looks like? How viral […]
-
Organizational Overlap on Social Networks and its Applications
Online social networks have become important tools for networking, communication, sharing, and discovery. A considerable challenge these networks face is the fact that an online social network is partially observed: two individuals might know each other, but may not have established a connection on the site. Therefore, link prediction and recommendations are important tasks for […]
-
Beyond MapReduce
Google announced that they are not using MapReduce anymore: “Google dumps MapReduce favor new hyper scale analytics system”. MapReduce has been a simple abstraction that has made large scale data processing easier, scalable, and fault-tolerant. However, MapReduce paradigm does not work well for many use cases such as stream processing, iterative computation, graph processing, real-time […]
-
Summary of a few papers from SIGIR 2012 – Part I
(photo from: http://www.city-data.com/picfilesv/picv32970.php) Here is a short summary of a few papers from SIGIR 2012: Adaptation of the Concept Hierarchy Model with Search Logs for Query Recommendation on Intranets by Ibrahim Adeyanju, Dawei Song, M-Dyaa Albakour, Udo Kruschwitz, Anne De Roeck and Maria Fasli. This paper talks about enhancing query suggestions on Intranets. The paper […]
-
Related Searches at LinkedIn
Search plays an important role in online social networks such as LinkedIn as it provides an essential mechanism for discovering members and content on the network. Related search recommendation is one way for improving members’ search experience in finding relevant results. We recently wrote a paper on Metaphor, the related search recommendation system at LinkedIn […]
-
On leap second bug
Last Saturday “leap second” adjustment caused issues with many online sites: “Leap second bug wreaks havoc across web”. Google’s SRE team posted a nice blog post on how they fixed leap second issue: “Time, technology and leaping seconds” by “leap smear”, where they change duration of each second reported by NTP depending on “leap second” is […]
-
igraph: a nice graph visualization package in R
Trying out igraph, a graph visualization package in R. Looks promising.