大理大学学报 ›› 2026, Vol. 11 ›› Issue (6): 39-45.

• 数学与计算机科学 • 上一篇    下一篇

应用型高职院校大学生创新创业项目数据分类方法

  

  1. (安徽工业经济职业技术学院,合肥 230051)
  • 收稿日期:2025-04-18 出版日期:2026-06-15 发布日期:2026-06-30
  • 作者简介:黄雪飞,副教授,主要从事大学生职业生涯规划、就业与创业指导、大学生心理健康教育、高校学生管理研究。
  • 基金资助:
    安徽省高校科研项目(2024SK02);安徽省职成教教研项目(AZCJ2023093);安徽省教育厅高校哲学社会科学研究项目(2023AH052668)

Data Classification Method for College Students' Innovation and Entrepreneurship Projects in Applied Higher Vocational Colleges

  1. (Anhui Technical College of Industrial and Economy, Hefei 230051,China)
  • Received:2025-04-18 Online:2026-06-15 Published:2026-06-30

摘要: 数据分类是应用型高职院校大学生创新创业项目数据管理工作中的重要内容,但由于数据不平衡性,分类器可能存在泛化能力不足的问题,导致分类结果不够准确。为解决这一问题,本研究提出一种创新创业项目数据分类方法。首先将项目数据库中的文本数据转化为向量形式,建立基于支持向量机的机器学习分类器;然后采用自适应合成抽样的方法对该分类器进行过采样,解决数据不平衡问题,以优化分类器的泛化能力;最后利用分类器对转化后的项目数据进行分类,并采用Kmeans算法进行聚类集成,实现项目数据的分类处理。实验结果表明,该方法对于创新创业数据的分类较为准确,平均精度均值达到0.986,F1分数达到0.973,分类性能优良,具有良好的实践应用前景。

关键词: 数据分类, 大学生创新创业项目, 应用型, 项目数据, 高职院校

Abstract: Data classification is a crucial component in the management of innovation and entrepreneurship project data for vocational college students. However, due to data imbalance, classifiers may suffer from insufficient generalization capabilities, leading to less accurate classification results. To address this issue, this study proposes a data classification method for innovation and entrepreneurship proj⁃
ects. First, textual data from the project database is converted into vector form, and a machine learning classifier based on support vector machines is established. Then, an adaptive synthetic sampling method is employed to oversample the classifier, mitigating data imbalance and optimizing its generalization capabilities. Finally, the transformed project data is classified using the classifier, and the K-means algorithm is applied for clustering integration to achieve project data classification. Experimental results demonstrate that this method achieves high accuracy in classifying innovation and entrepreneurship data, with a mean average precision of 0.986 and an F1 score of 0.973, exhibiting excellent classification performance and promising practical application prospects.

Key words: data classification, college student's innovation and entrepreneurship projects, applied type, project data, higher voca?
tional colleges

中图分类号: