Sorts a dfm by descending frequency of total features, total features in documents, or both.
dfm_sort(x, decreasing = TRUE, margin = c("features", "documents", "both"))
x | Document-feature matrix created by |
---|---|
decreasing | logical; if |
margin | which margin to sort on |
A sorted dfm matrix object
Ken Benoit
dfmat <- dfm(data_corpus_inaugural) #> Warning: 'dfm.corpus()' is deprecated. Use 'tokens()' first. head(dfmat) #> Document-feature matrix of: 6 documents, 9,439 features (93.84% sparse) and 4 docvars. #> features #> docs fellow-citizens of the senate and house representatives : #> 1789-Washington 1 71 116 1 48 2 2 1 #> 1793-Washington 0 11 13 0 2 0 0 1 #> 1797-Adams 3 140 163 1 130 0 2 0 #> 1801-Jefferson 2 104 130 0 81 0 0 1 #> 1805-Jefferson 0 101 143 0 93 0 0 0 #> 1809-Madison 1 69 104 0 43 0 0 0 #> features #> docs among vicissitudes #> 1789-Washington 1 1 #> 1793-Washington 0 0 #> 1797-Adams 4 0 #> 1801-Jefferson 1 0 #> 1805-Jefferson 7 0 #> 1809-Madison 0 0 #> [ reached max_nfeat ... 9,429 more features ] head(dfm_sort(dfmat)) #> Document-feature matrix of: 6 documents, 9,439 features (93.84% sparse) and 4 docvars. #> features #> docs the of , and . to in a our we #> 1789-Washington 116 71 70 48 23 48 31 14 1 1 #> 1793-Washington 13 11 5 2 4 5 3 0 0 0 #> 1797-Adams 163 140 201 130 33 72 47 51 6 3 #> 1801-Jefferson 130 104 128 81 37 61 24 21 24 10 #> 1805-Jefferson 143 101 142 93 41 83 35 20 24 13 #> 1809-Madison 104 69 47 43 21 61 34 19 9 2 #> [ reached max_nfeat ... 9,429 more features ] head(dfm_sort(dfmat, decreasing = FALSE, "both")) #> Document-feature matrix of: 6 documents, 9,439 features (96.33% sparse) and 4 docvars. #> features #> docs notification 14th fondest predilection flattering asylum #> 1793-Washington 0 0 0 0 0 0 #> 1945-Roosevelt 0 0 0 0 0 0 #> 1865-Lincoln 0 0 0 0 0 0 #> 1905-Roosevelt 0 0 0 0 0 0 #> 1849-Taylor 0 0 0 0 0 0 #> 1829-Jackson 0 0 0 0 0 0 #> features #> docs interruptions awaken distrustful despondence #> 1793-Washington 0 0 0 0 #> 1945-Roosevelt 0 0 0 0 #> 1865-Lincoln 0 0 0 0 #> 1905-Roosevelt 0 0 0 0 #> 1849-Taylor 0 0 0 0 #> 1829-Jackson 0 0 0 0 #> [ reached max_nfeat ... 9,429 more features ]