Sklearn PCA is pca.components_ the loadings? I am pretty sure it is, but I am trying to follow along a research paper and I am getting different results from their loadings. I can't find it within the sklearn documentation.
相关问题
- how to define constructor for Python's new Nam
- streaming md5sum of contents of a large remote tar
- How to get the background from multiple images by
- Evil ctypes hack in python
- Correctly parse PDF paragraphs with Python
pca.components_
is the orthogonal basis of the space your projecting the data into. It has shape(n_components, n_features)
. If you want to keep the only the first 3 components (for instance to do a 3D scatter plot) of a datasets with 100 samples and 50 dimensions (also named features),pca.components_
will have shape(3, 50)
.I think what you call the "loadings" is the result of the projection for each sample into the vector space spanned by the components. Those can be obtained by calling
pca.transform(X_train)
after callingpca.fit(X_train)
. The result will have shape(n_samples, n_components)
, that is(100, 3)
for our previous example.This previous answer is mostly correct except about the loadings. components_ is in fact the loadings, as the question asker originally stated. The result of the fit_transform function will give you the principal components (the transformed/reduced matrix).