如何处理 Weka 中的大量功能?
How to handle large set of features in Weka?
我想使用 Weka 为 2000 个文档生成文档术语矩阵。由于数据集中有超过 30000 个可用特征,它给出了以下错误。
org.rosuda.REngine.REngineException: Parse error
org.rosuda.REngine.REngineException: Parse error
org.rosuda.REngine.JRI.JRIEngine.parse(JRIEngine.java:262)
org.rosuda.REngine.REngine.parseAndEval(REngine.java:101)
org.rosuda.REngine.REngine.parseAndEval(REngine.java:108)
weka.core.RSessionImpl.parseAndEval(RSessionImpl.java:699)
weka.core.RSession.parseAndEval(RSession.java:361)
weka.core.RUtils.instancesToDataFrame(RUtils.java:115)
weka.gui.beans.JavaGDConsolePanel.pushInstancesToR(JavaGDConsolePanel.ja
va:771)
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
java.lang.reflect.Method.invoke(Unknown Source)
weka.gui.explorer.ExplorerRConsolePanel.setInstances(ExplorerRConsolePan
el.java:133)
weka.gui.explorer.Explorer.propertyChange(Explorer.java:261)
java.beans.PropertyChangeSupport.fire(Unknown Source)
java.beans.PropertyChangeSupport.firePropertyChange(Unknown Source)
java.beans.PropertyChangeSupport.firePropertyChange(Unknown Source)
weka.gui.explorer.PreprocessPanel.run(PreprocessPanel.java:535)
java.awt.event.InvocationEvent.dispatch(Unknown Source)
java.awt.EventQueue.dispatchEventImpl(Unknown Source)
java.awt.EventQueue.access0(Unknown Source)
java.awt.EventQueue.run(Unknown Source)
java.awt.EventQueue.run(Unknown Source)
java.security.AccessController.doPrivileged(Native Method)
java.security.ProtectionDomain.doIntersectionPrivilege(Unknown
Source)
java.awt.EventQueue.dispatchEvent(Unknown Source)
java.awt.EventDispatchThread.pumpOneEventForFilters(Unknown Source)
java.awt.EventDispatchThread.pumpEventsForFilter(Unknown Source)
java.awt.EventDispatchThread.pumpEventsForHierarchy(Unknown Source)
... ...
Selected Use an artificial time index Data too large
to calculate PCA. Defaulting to a random projection.
如何解决?
您的异常是由于 WEKA 的 RPlugin 包。如果您不需要从 WEKA 访问 R,请卸载该包以避免此异常。
我想使用 Weka 为 2000 个文档生成文档术语矩阵。由于数据集中有超过 30000 个可用特征,它给出了以下错误。
org.rosuda.REngine.REngineException: Parse error org.rosuda.REngine.REngineException: Parse error org.rosuda.REngine.JRI.JRIEngine.parse(JRIEngine.java:262) org.rosuda.REngine.REngine.parseAndEval(REngine.java:101) org.rosuda.REngine.REngine.parseAndEval(REngine.java:108) weka.core.RSessionImpl.parseAndEval(RSessionImpl.java:699) weka.core.RSession.parseAndEval(RSession.java:361) weka.core.RUtils.instancesToDataFrame(RUtils.java:115) weka.gui.beans.JavaGDConsolePanel.pushInstancesToR(JavaGDConsolePanel.ja va:771) sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) java.lang.reflect.Method.invoke(Unknown Source) weka.gui.explorer.ExplorerRConsolePanel.setInstances(ExplorerRConsolePan el.java:133) weka.gui.explorer.Explorer.propertyChange(Explorer.java:261) java.beans.PropertyChangeSupport.fire(Unknown Source) java.beans.PropertyChangeSupport.firePropertyChange(Unknown Source) java.beans.PropertyChangeSupport.firePropertyChange(Unknown Source) weka.gui.explorer.PreprocessPanel.run(PreprocessPanel.java:535) java.awt.event.InvocationEvent.dispatch(Unknown Source) java.awt.EventQueue.dispatchEventImpl(Unknown Source) java.awt.EventQueue.access0(Unknown Source) java.awt.EventQueue.run(Unknown Source) java.awt.EventQueue.run(Unknown Source) java.security.AccessController.doPrivileged(Native Method) java.security.ProtectionDomain.doIntersectionPrivilege(Unknown Source) java.awt.EventQueue.dispatchEvent(Unknown Source) java.awt.EventDispatchThread.pumpOneEventForFilters(Unknown Source) java.awt.EventDispatchThread.pumpEventsForFilter(Unknown Source) java.awt.EventDispatchThread.pumpEventsForHierarchy(Unknown Source)
... ...
Selected Use an artificial time index Data too large to calculate PCA. Defaulting to a random projection.
如何解决?
您的异常是由于 WEKA 的 RPlugin 包。如果您不需要从 WEKA 访问 R,请卸载该包以避免此异常。