Compare commits

...

4 Commits

Author SHA1 Message Date
Animus
3741919937
Merge 7397e3d2b0 into fb54d8b549 2023-05-22 17:41:07 -05:00
Noting565
7397e3d2b0 Update ScoreFacadeStore.scala
The correct article to use before the word "uniform" is "an" in American English.
2023-03-31 17:30:02 -07:00
Animus
b04e3521a2
fix another grammatical error
this fixes another grammar error with assistance from @sdornan
2023-03-31 17:21:59 -07:00
Noting565
4c5db53916 fix grammar in simclusters-ann readme.md 2023-03-31 17:12:52 -07:00
2 changed files with 2 additions and 2 deletions

View File

@ -12,7 +12,7 @@ The cosine similarity between two Tweet SimClusters Embedding presents the relev
SimClusters from the Linear Algebra Perspective discussed the difference between the dot-product and cosine similarity in SimCluster space. We believe the cosine similarity approach is better because it avoids the bias of tweet popularity.
However, calculating the cosine similarity between two Tweets is pretty expensive in Tweet candidate generation. In TWISTLY, we scan at most 15,000 (6 source tweets * 25 clusters * 100 tweets per clusters) tweet candidates for every Home Timeline request. The traditional algorithm needs to make API calls to fetch 15,000 tweet SimCluster embeddings. Consider that we need to process over 6,000 RPS, its hard to support by the existing infrastructure.
However, calculating the cosine similarity between two Tweets is pretty expensive in Tweet candidate generation. In TWISTLY, we scan at most 15,000 (6 source tweets * 25 clusters * 100 tweets per clusters) tweet candidates for every Home Timeline request. The traditional algorithm needs to make API calls to fetch 15,000 tweet SimCluster embeddings. Considering that we need to process over 6,000 RPS, its hard to support with the existing infrastructure.
## SimClusters Approximate Cosine Similarity Core Algorithm

View File

@ -10,7 +10,7 @@ import com.twitter.storehaus.ReadableStore
import com.twitter.util.Future
/**
* Provide a uniform access layer for all kind of Score.
* Provide an uniform access layer for all kind of Score.
* @param readableStores readable stores indexed by the ScoringAlgorithm they implement
*/
class ScoreFacadeStore private (