OW2开源社区竞赛题目

匿名网友 匿名网友 发布于: 2015-12-31 00:00:00
阅读 91 收藏 0 点赞 0 评论 0

题目1ossean-webmagic:一种面向开源社区的分布式垂直数据采集框架

以Webmagic为内核,以站点爬虫和数据抽取两大组件为基础,实现面向开源社区的分布式站点页面抓取、属性信息抽取以及数据采集状态监控。

ossean-webmagic应具有如下特点:1、简单高效,仅需简单的配置即可完成新增目标站点的数据采集;2、高可扩展性,支持分布式数据抓取和存储,可快速方便扩展新节点;3、高容错性,数据采集和抽取阶段具有多重验证机制,保证数据的完整性和准确性。

 

ossean-webmagic: A distributed vertical data acquisition framework for Open Source Communities.

 

Based on the crawler and data extraction components provided by Webmagic to implement a distributed data acquisition framework. It aims to accomplish tasks including distributed webpage crawling, attributes extracting and crawler status monitoring. It should have the fowling features: (1)Simple yet efficient, which only needs simple reconfiguration to complete the data acquisition for new target site; (2)high extendibility, which support distributed data crawling and storing, and can add new nodes easily and quickly; (3) high fault tolerance, which has multiple validation mechanisms to ensure the complement and accuracy of data at the crawling and extracting stages.

 

题目2基于需求分析的开源软件推荐系统

开源软件世界的迅速发展,越来越多的软件基于成熟的开源软件搭建,例如Instagram用到的开源软件: Django、Linux、Redis、PostgreSQL、Apache Solr等。

基于需求分析的开源软件推荐系统基于用户在项目托管平台上新建一个项目给出的项目描述以及其他信息等,挖掘项目潜在需求,并据此为其推荐高质量的开源软件,帮助开发者实现快速的系统原型实现,加速软件的开发过程,同时激发项目开发者和用户提出新需求,实现软件需求和开源软件推荐不断交互和进化。

 

With the rapid development of OSS, increasing numbers of projects are built by reusing large numbers of existing mature OS components, such as the famous social network app Instagram which reused Django, Linux, Redis, PostreSQL, Apache Solr and so on. OSS recommendation system based on requirement analysis explore the potential requirements of submitted project by mining its descriptions and other related information, and recommend proper and high-quality open source software based on the analysis. It aims to provide developers with reusable components to help building prototypes, and accelerate the software development. Such prototypes will inspire developers and users to provide new requirements. The recommendation system aims to assist such new software development process.

 

题目三:开源软件的态势分析工具

开源软件已经成为软件产业的重要支柱,因此了解开源软件本身以及其发展趋势,可以对推动软件产业的发展产生极大的助力。本研究拟定从开源软件的软件社区热度和市场需求两个维度,对开源软件进行态势分析,给出开源软件在两种维度下的排名情况,以达到了解开源软件在软件社区中活跃程度和开源软件的就业需求市场,最终达到为开发人员推荐可用可信的开源软件,为求职者预测最具市场竞争力、最值得学习研究的开源软件。

Trend Analysis Tool for Open Source Software

Open source software has become an important pillar of the whole software industry. Therefore, understanding the open source software itself and its development trend will promote tremendous boost to the development of the software industry. In this study, we intend to analyze the situation of open source software from two dimensions: the heat of community and the demand of market. Then we will give the ranking of open source software in both dimensions so that we can understand the active level in software community and the job market needs of open source software. Finally, we will achieve the goal of recommending reliable open source software to developers and predict the most competitive and the most worth learning open source software for job seekers.

题目四:Real-time and distributed publication/subscription support in robot infrastructure software.

Keywords: Real-time, publication/subscription, robot, infrastructure software

Description: Robotics is an emerging technology that could directly affect our daily lives in the near future. Since a robot is a typical cyber-physical system, the real-time property plays a significant role in its software construction. In a distributed environment (such as the collaboration of a group of robots), the realization of the real-time property needs the necessary support in the infrastructure software, such as the real-time cross-node message pub/sub capability. However, mainstream open-source robot infrastructure software, such as ROS (Robot Operating System), doesn’t provide this ability in its design yet. The aim of this topic is to realize an open source distributed robot software infrastructure that supports real-time message publication/subscription crossing multiple robot nodes. This infrastructure should be based on the existing open-source achievements in the two fields of both robot infrastructure software and real-time message pub/sub middleware.

 

评论列表
文章目录