The source data can be found at orgs.json (335 KB). It has some extra fields like links, licenses, and categories, although they've been omitted here. I would like to add previous years, and I'd like to add the projects under each organization as well. But the previous years are more complicated to get: 2022-2016, 2015-2009, 2008, 2007, 2006, and 2005.
We develop Faust (Functional Audio Stream), a functional programming language for sound synthesis and audio processing with a strong focus on the design of synthesizers, musical instruments, audio effects, etc. Faust targets high-performance signal processing applications and audio plug-ins for a variety of platforms and standards.
The International Neuroinformatics Coordinating Facility (INCF; www.incf.org) is an international organization launched in 2005, following a proposal from the Global Science Forum of the OECD.
INCF was established to facilitate and promote the sharing of data and computing resources in the international neuroscience community. A larger objective of INCF is to help develop scalable, portable, and extensible applications that can be used by neuroscience laboratories worldwide.
The mission of INCF is to make neuroscience FAIR (Findable, Accessible, Interoperable and Reusable) by sharing and integrating neuroscience data and knowledge worldwide. We foster scientific community collaboration to develop standards for data sharing, analysis modeling and simulation in order to catalyze insights into brain function in health and disease.
INCF activities are open to all who can contribute to neuroinformatics at the international level. We have a global community of neuroscience researchers working on new and improved tools for all of neuroscience – enabling other researchers to make more and faster discoveries, and improving our understanding of the brain.
Haiku is a fast, efficient, easy to use and lean open source operating system inspired by the BeOS that specifically targets personal computing.
Haiku is not a Linux distribution, nor does it use the Linux kernel. Haiku is the spiritual successor to BeOS and it is derived from the NewOS kernel, which was authored by Travis Geiselbrecht (geist), who was formerly employed by Be Inc. — the developers of BeOS.
Linux-based distributions stack up software – the Linux kernel, the X Window System, and various DEs with disparate toolkits such as GTK+ and Qt – that do not necessarily share the same guidelines and/or goals. This lack of consistency and overall vision manifests itself in increased complexity, insufficient integration, and inefficient solutions, making the use of your computer more complicated than it should actually be.
Instead, Haiku has a single focus on personal computing and is driven by a unified vision for the whole OS. That, we believe, enables Haiku to provide a leaner, cleaner and more efficient system capable of providing a better user experience that is simple and uniform throughout.
We operates as an umbrella organization (The OpenCAx Association) with several CAx communities including:
- KiCAD is an electronics design automation (EDA) suite
- OpenSCAD is a solid 3D modeler with a rich syntax for programmable geometry.
- LibreCAD is a 2D modeling system specializing in blueprint-style drawings and draftings.
- FreeCAD provides parametric 3D modeling with engineering functionality like FEM and CAM.
- IfcOpenShell is a library for working with standard IFC building model data.
- BRL-CAD is a solid modeling suite with conversion and advanced solid ray tracing features.
We want to select at least one student for each, so feel free to ask us where to start.
The Open Chemistry project is a collection of open source, cross platform libraries and applications for the exploration, analysis and generation of chemical data. The organization is an umbrella of projects developed by long-time collaborators and innovators in open chemistry such as the Avogadro, Open Babel, RDKit and cclib projects. These projects have been downloaded over 1,000,000 times and cited in over 2,000 academic papers. Our goal is to improve the state of the art, and facilitate the open exchange of chemical data and ideas while utilizing the best technologies from quantum chemistry codes, molecular dynamics, informatics, analytics, and visualization.
Freifunk unites wireless communities like Ninux, qaul.net, Guifi.net, and Evernet e.G. Our communities extensively rely on OpenWRT Linux, OLSR, BATMAN, libremesh, or retroshare. Our communities build decentralized wireless network architectures based on embedded networking hardware such as WiFi routers. The contributions are made by individuals and local groups which are highly motivated to build open and free wireless networks providing Internet access to everybody. Moreover, we also create solutions that allow anonymous and secure communication.
Most wireless routers in our networks are based on affordable, off-the-shelf devices such as WiFi routers. The Freifunk operating system extends the OpenWrt Linux OS by additional software packages that enable multi-hop wireless mesh networking and new P2P communication features.
MuseScore's main purpose is the creation of high-quality engraved musical scores in a What-You-See-Is-What-You-Get environment. It supports unlimited staves, linked parts and part extraction, tablature, MIDI input and output, percussion notation, cross-staff beaming, automatic transposition, lyrics (multiple verses), fretboard diagrams, and in general everything commonly used in sheet music.
Style options to change the appearance and layout are available and style sheets can be saved and applied to other scores. In January 2020, MuseScore introduced a unique engraving and notation style built by its full time designer, Martin Keary and engraver Simon Smith.
MuseScore can play back scores through a sequencer and SoundFont sample library. Playback is currently being greatly expanded to include VST support. MIDI output to external devices and software synthesisers is also possible.
The team organise themselves according to a general roadmap published by the product owner every six months, which sets out provisional goals for the internal team. This allows the extended developer community to comment on the roadmap and potentially offer to take ownership over aspects of it. This also allows other contributors to have visibility on aspects of the codebase that may change over the next 6-12 months. In combination with our community organiser, tasks are drawn up and published to provide a well organised collaboration focused on a specific upcoming release.
All members provide bug tracking support and feedback.
GeomScale is a research and development project that delivers open source code for state-of-the-art algorithms at the intersection of data science, optimization, geometric, and statistical computing. The current focus of GeomScale is on scalable algorithms for sampling from high-dimensional distributions, integration, convex optimization, and their applications. One of our ambitions is to fill the gap between theory and practice by turning state-of-the-art theoretical tools in geometry and optimization to state-of-the-art implementations. Towards this goal, we will deliver various innovative solutions in a variety of application fields, like finance, computational biology, and statistics that will extend the limits of contemporary computational tools. GeomScale aims in serving as a building block for an international, interdisciplinary, and open community in high dimensional geometrical and statistical computing. The main development is currently performed in volesti, a generic open source C++ library, with R and python interfaces (the latter is hosted in package dingo), for high-dimensional sampling, volume approximation, and copula estimation for financial modelling. In particular, the current implementation scales up to hundred or thousand dimensions, depending on the problem. To our knowledge it is the most efficient software package for sampling and volume computation to date. It is, in several cases, orders of magnitude faster compared to packages that solve the same problems. It can be used to compute challenging multivariate integrals and to approximate optimal solutions in optimization problems. It has already found important applications in systems biology by analyzing large metabolic networks (e.g., the latest human network) and in FinTech by detecting shock events and by evaluating portfolios performance in stock markets with thousands of assets. Other application areas include AI and in particular approximate weighted model integration.
Mathematicians, scientists, researchers, and students need a powerful tool for their work or study. SageMath is a freely available open-source mathematical software system bundling the functionality of many software libraries, exposing their features in a common interface and extending on top of this with its own powerful algorithms. By leveraging the flexibility and universality of the underlying Python interpreter, SageMath is able to accommodate for a vast range of their requirements.
The mission of SageMath is to create a viable open-source alternative to all major proprietary mathematical software systems.
Python is the main programming language inside the SageMath library and also the language of choice for all interactions with the built-in objects and functions for expressing mathematical concepts and calculations. Besides a command-line and programming-library interface, its primary user interface is a dynamic self-hosted website. From the perspective of a user, the interface language is Python, but with a thin extension built directly on top of it.
Almost all areas of mathematics are represented in SageMath, at various levels of sophistication. This includes symbolic calculus, 2D and 3D graphics, polynomials, graph theory, group theory, abstract algebra, combinatorics, cryptography, elliptic curves and modular forms, numerical mathematics, linear algebra and matrix calculations (over various rings), support for parallel computing, and a powerful coercion framework to “mix” elements from different sets for calculations. SageMath’s features also expand into neighboring fields like Statistics and Physics.
CHIPS Alliance develops high-quality, open source hardware components and tooling for silicon devices and FPGAs. By creating an open and collaborative environment, shared infrastructure, processes, legal support and governance, CHIPS Alliance shares resources to lower the cost of development and increase confidence in high-quality open source building blocks it helps manage. Within CHIPS Alliance's workgroups covering areas such as Analog, FPGA, ASIC design tools, Chisel, Interconnects and Cores, companies and individuals can work together to develop open source IP, tools and standards. CHIPS Alliance is open to all organizations who are interested in collaborating on open source hardware or software tools to accelerate the creation of more efficient and innovative chip designs.
omegaUp is a non-profit organization (501c3) aimed to increase the number of talented Software Engineers in Latin America. Our open source platform omegaUp.com lets students immerse in a learning environment that fosters self paced learning of computer science skills with a democratic access to state-of-the-art learning tools.
Teachers and tutors can create new coding challenges or use existing ones to start online programming competitions with local, national or even international reach, with automated grading of student's coding solutions. The omegaUp.com platform also enables teachers to leverage Competitive Programming tools and concepts inside the classroom to improve their educational experience.
Creative Commons (CC) is a United States-based nonprofit 501(c)3 organization that provides open content copyright licenses, public domain tools, and resources on copyright and information literacy in the digital age. Our free, easy-to-use copyright licenses provide a simple, standardized way for all creators, authors, and producers of knowledge assets and cultural works to give the public permission to share and use their works on conditions of their choice. CC licenses work in tandem with copyright, allowing creators to easily and legally change copyright terms from the default of “all rights reserved” to “some rights reserved” to best suit their needs.
In addition to the ongoing development and improvement of legal and technical tools for sharing, CC runs programs to put these tools into use in the world, and to mainstream open access, broad sharing, and reuse of creative works and knowledge assets. CC’s programmatic work encompasses outreach, trainings, partnerships, and much more, targeted to achieve measurable impact in improving affordable access to and enhancing reuse of content by anyone, anywhere, towards a more informed, culturally enriched, and productive society. All of CC’s work is amplified by a diverse and global network of volunteer affiliate groups.
Inkscape is a free and open-source vector graphics editor used to create vector images, primarily in Scalable Vector Graphics (SVG) format
Pharo is a dynamic, purely object-oriented programming language (everything is an object) in the tradition of Smalltalk. It is also a powerful IDE, focused on simplicity and immediate feedback. Its entire syntax fits on a postcard, and coding can be done directly in the debugger. Pharo has cool tools that empower you and make you highly efficient.
Pharo's goal is to deliver a clean, innovative, free and open-source immersive environment. By providing a stable and small core system, excellent development tools, and maintained releases, Pharo is an attractive platform to build and deploy mission-critical applications.
Pharo fosters a healthy ecosystem developed from both private and commercial contributors who advance and maintain the core system and its external packages.
CCExtractor Development is the creator of the de-facto captions extraction tool - CCExtractor. It is the one tool that is free, portable, open source and community managed that can take a recording from a TV show and generate an external subtitle file for it.
If you regularly watch content with subtitles you download from fan sites - you should know that the source file is most likely generated by CCExtractor. If you are a student in a university that uses subtitles for natural language study, you should know that most likely we are involved somehow.
While we already support subtitles from North America, Europe, Australia and more, our world map is not yet complete. We are actively looking for students that want to help us fill the gaps. We also want to automate many of the processes that are currently done manually, such as achieving perfect sync, our media file management.
In addition to continuously evolve our core tool, we have a growing number of new projects: support, AI, rclone, cloud, flutter, peer-to-peer and a few more that are starting to take shape.
The best part is - students have flexibility of choosing projects from a wide range of topics & technologies and even propose their own. We provide exceptional resources for students. Read more about the perks on our website.
We’re very excited to take part in GSoC for the 9th time. Most of our past students are still involved and are active in the community, which simply goes on to show how friendly and approachable is the environment. If you want to be a part of such community and help achieve our goals, come join our Slack group or mailing list!
Global Alliance for Genomics and Health
The Global Alliance for Genomics and Health (GA4GH) was formed to help accelerate the potential of genomic medicine to advance human health. It brings together over 400 leading genome institutes and centers with IT industry leaders to create global standards and tools for the secure, privacy respecting and interoperable sharing of Genomic data.
The openSUSE Project is a worldwide effort that promotes the use of Linux, tools around it, and open source. The openSUSE community is made up of multiple contributing communities that collaborate as part of a global open-source network. The openSUSE community develops, builds and maintains many of the packages, tools and infrastructure for the distribution. The community works together in an open, transparent and friendly manner as part of the global Free and Open Source Software community. openSUSE creates one of the world's best Linux distributions, as well as a variety of tools, such as OBS, OpenQA, Kiwi, YaST, OSEM and Uyuni. Distributions include a rolling release (Tumbleweed), a stable annual release (Leap) and operating systems for edge, embedded, cloud and containers through MicroOS and ALP.
The project is controlled by its community and relies on the contributions of individuals, working as testers, writers, translators, usability experts, artists and ambassadors or developers. The project embraces a wide variety of technology, people with different levels of expertise, speaking different languages and having different cultural backgrounds.
The SCoRe Lab has conducted research covering various aspects of security, digital forensic, mobile applications, cloud, blockchain and software tools. The goal of our research is to generate computing solutions by identifying low-cost methodologies and strategies that lead to sustainability. At present, the SCoRe Lab is at a stage of its evolution in which it has been able to secure high donor confidence as evidenced by simultaneous foreign-funded projects. SCoRe Lab studies and produces sustainable computing solutions with respect to low-cost computing and communication foundations in the developing and emerging regions in the world. We have developed several affordable and sustainable ICT solutions especially focusing on the requirements in the developing region. These solutions are briefly described in the projects section.
Checkstyle is a development tool to help programmers write Java code that is easy to read and adheres to a coding standard. Our utility automates the process of checking Java code to spare humans of this boring (but important) task. This makes it ideal for projects that want to enforce a coding standard. Each and every check also forces our users who are not familiar with these standards to read them and makes them think about why these standards exist.
The Tor Project
We believe everyone should be able to explore the internet with privacy. We are the Tor Project, a 501(c)(3) US nonprofit. We advance human rights and defend your privacy online through free software and open networks.
Unikraft is a fast, secure and open-source Unikernel Development Kit.
By tailoring the operating system, libraries and configuration to the particular needs of your application, it vastly reduces virtual machine and container image sizes to a few KBs, provides blazing performance, and drastically cuts down your software stack’s attack surface.
Unikraft is developed in the spirit of open source: public discussions on Discord, open source licensing model using BSD-3-Clause, public development and management on GitHub.
It does so in the context of a vibrant community consisting of highly skilled software engineers, researchers, teachers, students and hobbyists. Periodic meetings, hackathons and a consistent presence in open source events are central to the well functioning of the community.
We use guidelines for development and maintenance to ensure the creation of high quality code.
Public releases are planned to happen once every two months. In general, we aim for a public release to happen at the last Monday of each odd month (January, March, May, etc.)
We welcome contributors and users on Discord and on GitHub. If you are looking for a fun technical project in the area of operating systems, virtualization, come aboard on Discord.
Wikimedia strives to bring about a world in which every single human being can freely share in the sum of all knowledge; through various projects, chapters, and the support structure of the non-profit Wikimedia Foundation. Wikimedia officially supports 13 projects, including Wikipedia, the seventh most popular site on the internet. Wikipedia receives over 20 billion global page views every month and is available in over 300 languages. The tech behind it ensures that our projects are fast, reliable, and open to all. We design and build the open-source technology that powers Wikimedia projects from hosting Wikipedia to creating edit-checking artificial intelligence (AI). Community volunteers and Foundation technologists collaborate on MediaWiki, which makes sharing free knowledge possible. Read more about Wikimedia on our homepage.
LibreHealth is the foundation of a worldwide ecosystem of open source Health IT innovation and is a place where people can come together to build tools that enhance the quality of healthcare around the world.
We currently have under our umbrella the following projects:
LibreHealth Toolkit (http://librehealth.io/projects/lh-toolkit/), a foundational base for - building Health IT tools
LibreHealth EHR (http://librehealth.io/projects/lh-ehr/), an electronic health record derived from best practices and technology from leading open source systems.
LibreHealth Radiology (https://librehealth.io/projects/lh-radiology), a specialized distribution of LibreHealth Toolkit customized for radiology healthcare professionals
Our GSoC student projects will address the real-life needs of our projects to help improve the delivery of health care around the world. We have a team of expert mentors with decades of experience to help you in your work. They will be continually adding project ideas to our forum (https://forums.librehealth.io), and we encourage you to suggest ideas too as you learn more about our projects.
Check out project ideas (this will change from year to year): https://forums.librehealth.io/ideas
MZmine and MS-DIAL
Mass spectrometry is an analytical technique that measures the mass and abundance of small molecules with very high precision. However, the datasets produced by mass spectrometry instruments are complex and multi-dimensional. MZmine (https://mzmine.org; implemented in Java and JavaFX) and MS-DIAL (http://prime.psc.riken.jp/compms/msdial/main.html; implemented in C#) are popular open-source graphical software tools for mass spectrometry data processing. Both contain a library of processing algorithms and powerful visualization modules.
Both MZmine and MS-DIAL are highly cited (>4500 cumulated citations in Pluskal et al., BMC Bioinformatics 2010 and Tsugawa et al., Nature Methods 2015) demonstrating the impact the software tools have in the biomedical research community.
MZmine 3 is based on JavaFX for its user interface and provides support for cutting-edge techniques such as ion mobility separation. We aim to provide user-friendly, flexible, and extendable modular software covering the entire mass spectrometry data analysis workflow.
MS-DIAL 5 follows the mind of MZmine, and the program is written by C# and WPF. As the unique functions, the program supports the chromatogram deconvolution for data independent acquisition. Furthermore, MS-DIAL 5 also supports the annotation function for various mass fragmentation techniques, mass spectrometry imaging, and proteomics, resulting in the environment for multiomics analysis.
MZmine successfully participated in Google Summer of Code in the years 2017, 2018, and 2019 under the OpenChemistry umbrella, and in 2020 and 2022 as its own organization. Now both MZmine and MS-DIAL projects team up for GSoC 2023.
The X.Org Foundation (or X.Org for short) is a foundation chartered to develop and execute effective strategies that provide worldwide stewardship and encouragement of the X Window System and related projects. Indeed, X.Org is much broader than just the X Window System. Under the umbrella of the X.Org Foundation can be found Linux's DRM subsystem (10% of the size of Linux), Mesa (open source 3D and video-decoding acceleration for AMD, Intel, NVIDIA,...), and Wayland. X.org's technologies underpin much of today's computing environment, and expertise in it is in high demand worldwide.
Today, as the result of more than 20 years of work by teams of leading open source developers, most of the graphical user interfaces for Unix and Linux systems rely on X.Org. On top of the X-Server-based systems, this includes Android- and ChromeOS-based devices, and Wayland-based systems (Sailfish OS, Gnome, ...). X.Org is responsible for the design of the X libraries which interface with application, the acceleration architectures used for graphics, and the graphics and input drivers. In particular, it has been at the center of the recent restructuring of the Linux graphics driver stack.
Come help us make the future more open!
XWiki is an open source software development platform based on the wiki principles, under the LGPL license. In addition to being a full-featured wiki, it is also a second generation wiki allowing effortless development of collaborative web applications. On top of this platform a plethora of applications are developed, targeted mainly on aiding enterprise-level needs.
XWiki has a vibrant community of developers and users, consisting of individual users as well as organizations around the world that are using XWiki for their own Communities or Intranets.
We propose projects that cover server-side Servlet programming and client-side rich application development, together with usability and performance improvements.
cBioPortal for Cancer Genomics
The cBioPortal for Cancer Genomics is a resource designed to provide broad community access to cancer genomic data. It provides a unique user-friendly and biology-centric computational user interface, with the goal of making genomic data more easily accessible to translational scientists, biologists, and clinicians. The interface was explicitly built and continues to evolve with careful usability studies involving multiple biological and clinical users, and an active and engaged user base.
The public instance of cBioPortal (https://cbioportal.org/) is now one of the most popular online resources for cancer genomics data and attracts more than 4,000 unique visitors (cancer researchers and clinicians) per day. The two papers documenting the cBioPortal Cerami et al. Cancer Discov. 2012 & Gao et al. Sci. Signal. 2013 have been cited more than 5,600 and 5,100 times, respectively, since their publication. There are more than 30 actively used cBioPortal instances in hospitals, universities, pharmaceutical companies, and other institutes all over the globe.
We are a group of software engineers, bioinformaticians, and cancer biologists building software solutions for precision medicine for cancer patients. Our overall goal is to build infrastructure to support clinical decisions for personalized cancer treatment by utilizing “big data” of cancer genomics and patient clinical profiles. Our multi-institutional team currently has more than 30 active members, primarily from Memorial Sloan Kettering Cancer Center, the Dana-Farber Cancer Institute, Princess Margaret Cancer Centre, Children's Hospital of Philadelphia, and The Hyve, a bioinformatics company from the Netherlands.
FFmpeg is the leading multimedia framework, able to decode, encode, transcode, mux, demux, stream, filter and play pretty much anything that humans and machines have created. It supports the most obscure ancient formats up to the cutting edge. No matter if they were designed by some standards committee, the community or a corporation. It is also highly portable: FFmpeg compiles, runs, and passes our testing infrastructure FATE across Linux, Mac OS X, Microsoft.
The UC Open Source Program Office (OSPO) supports open source work throughout the University of California system. Beginning in 2022, the UC OSPO took over the programmatic responsibilities of the UCSC Center for Research in Open Source Software (CROSS), including mentorship activities such as GSoC. Following in line with the earlier work of CROSS, the OSPO creates partnerships with stakeholders within and outside the UC system in order to help students learn from open source communities, support scientists in using open source to accelerate research efforts, and connect students and scientists with sponsors from industry, government, and foundations. The OSPO helps bridge the gap between academic research and successful open source projects by promoting innovative projects maintained by UC-affiliated scientists and researchers. We support the transfer of cutting-edge technology resulting from UC-originating research to industry via successful open source projects. Due to the multi-campus nature of our current efforts, the projects we support and promote cover a wide range of topics and technologies - including:
- Open Source Hardware
- Storage Systems and Devices
- System Architecture
- Autonomous Vehicle Systems
- Data Management
- Data Analysis
All of our mentors are UC-affiliated scientists and researchers who are actively involved in one or more of these open source projects.
MetaBrainz Foundation Inc
The MetaBrainz Foundation is a non-profit that believes in free, open access to data. It has been set up to build community maintained databases and make them available in the public domain or under Creative Commons licenses.
Our data is mostly gathered by volunteers and verified by peer review to ensure it is consistent and correct. All non-commercial use of this data is free, but commercial users are asked to support us in order to help fund the project. We encourage all data users to contribute to the data gathering process so that our data can be as comprehensive as possible.
With this data we are building a music social network and bias free music recommendations.
The Palisadoes Foundation
The Palisadoes Foundation’s open-source software projects support community groups in organizing their daily activities. A significant proportion of our software contributions come from university students studying software engineering. Participants often come from under-served communities or geographic areas and are sponsored through our various programs. This prepares them for the competitive realities of the working world.
Life started for us in 2016 when a group of expatriate Jamaicans wanted to assist development of new and existing information technologies for the island’s social good. We knew academic and student leadership wanted greater practical exposure to the latest technology. In response, our pilot Calico program for Jamaican universities was created with the goal of addressing the issues graduates would face when employed.
After five years of success, members of the Jamaican diaspora suggested we expand our service area to be global. They saw similar needs in their own under-served communities where the digital divide is very visible and thought our experience in the developing world would be useful.
Since 2019 we have focused on creating a mobile app to help community organizations like ourselves better manage their membership. We saw the need for a social media component to keep in contact with our volunteers. We also felt these organizations often faced challenges with the project management of events and keeping track of volunteer abilities, roles and responsibilities.
We want to make the app useful for any community-based organization such as clubs, small religious institutions, neighborhood groups, and volunteer associations. We also want to see how the open source software could be installed as a multi-tenant, multi-site cloud hosted service as many of these organizations often don’t have sufficient IT skills to do so independently. We hope that our contributors will help us achieve this goal year.
The ns-3 Network Simulator Project
Are you interested in contributing to a widely-used performance evaluation tool for computer networking research? ns-3 is a discrete-event, packet-level network simulator with an emphasis on networking research and education. Users of ns-3 can construct simulations of computer networks using models of traffic generators, protocols such as TCP/IP, and devices and channels such as Wi-Fi and LTE, and analyze or visualize the results. Simulation plays a vital role in the research and education process, because of the ability for simulations to obtain reproducible results (particularly for wireless protocol design), scale to large networks, and study systems that have not yet been implemented. A particular emphasis in ns-3 is a high degree of realism in the models (including frameworks for using real application and kernel code) and integration of the tool with virtual machine environments and testbeds. Very large scale simulations are possible; simulations of hundreds of millions of nodes have been published. ns-3 has been in development since 2005 and has been making regular releases since June 2008. The simulator is written in C++, with bindings for Python scripting, and uses the CMake build system. We use a GPLv2 licensing model and heavily use mailing lists and Zulip chat, but typically not other social media.
FOSSology is an open source license compliance software system and toolkit. As a toolkit you can run license, copyright and export control scans from the command line. As a system, a database and web UI are provided to give you a compliance workflow. License, copyright and export scanners are tools used in the workflow.
Cameras are complex devices that need heavy hardware image processing operations. Control of the processing is based on advanced algorithms that must run on a programmable processor. This has traditionally been implemented in a dedicated MCU in the camera, but in embedded devices, algorithms have been moved to the main CPU to save cost. Blurring the boundary between camera devices and Linux often left the user with no other option than a vendor-specific closed-source solution.
To address this problem the Linux media community is collaborating with the industry to develop a camera stack that is open-source-friendly while still protecting vendor core IP. libcamera was born out of that collaboration and offers modern camera support to Linux-based systems, including traditional Linux distributions, ChromeOS and Android.
RTEMS (Real-Time Executive for Multiprocessor Systems) is a free real-time operating system (RTOS) designed for deeply embedded systems such as automobile electronics, robotic controllers, and on-board satellite instruments.
RTEMS is free open source software that supports multi-processor systems for over a dozen CPU architectures and over 150 specific system boards. In addition, RTEMS is designed to support embedded applications with the most stringent real-time requirements while being compatible with open standards such as POSIX. RTEMS includes optional functional features such as TCP/IP and file systems while still offering minimum executable sizes under 20 KB in useful configurations.
The RTEMS Project is the collection of individuals, companies, universities, and research institutions that collectively maintain and enhance the RTEMS software base. As a community, we are proud to be popular in the space application software and experimental physics communities. RTEMS has been to Venus, circles Mars, is aboard Curiosity, is in the asteroid belt, is on its way to Jupiter, and is circling the sun. It is in use in many high energy physics research labs around the world. There are many RTEMS users who do not belong to the space or physics communities, but our small part in contributing to basic scientific knowledge makes us proud.
The Eclipse Foundation provides our global community of individuals & organizations with a mature, scalable, and business-friendly environment for OSS collaboration and innovation.
Eclipse is an open source community that's focused around key principles of transparency, openness, and vendor neutrality: the work that we do is done in a manner that can be observed by anybody with an interest; project teams welcome new ideas, and invites others to participate; and vendor neutrality ensures that no single vendor can dominate a project and that everybody plays by the same set of rules (a so-called level playing field).
Naturally, Eclipse projects are also all about the code. With over three hundred and sixty (https://projects.eclipse.org/) open source projects covering a diverse set of of technologies, there's something here for everybody.
Eclipse projects build technology in areas such as Internet of Things (https://projects.eclipse.org/technology-type/internet-things), Programming Languages and IDE (https://projects.eclipse.org/technology-type/language), and Runtimes (https://projects.eclipse.org/technology-type/runtime) like Jetty and EE4J (http://www.eclipse.org/ee4j) (currently known as Java EE).
For those students interested in research, we have an entire working group focused on Science (https://projects.eclipse.org/projects/science) where researches from some of the world's most prestigious labs do open source development to support their research areas.
FreeCAD is a general-purpose parametric 3D computer-aided design (CAD) modeler and a building information modeling (BIM) software application with finite element method (FEM) support. It is intended for mechanical engineering product design but also expands to a wider range of uses around engineering, such as architecture or electrical engineering. FreeCAD is free and open-source, under the LGPL-2.0-or-later license, and available for Linux, macOS, and Windows operating systems. Users can extend the functionality of the software using the Python programming language.
National Resource for Network Biology (NRNB)
The National Resource for Network Biology (NRNB, https://www.nrnb.org) organizes the development of free, open-source software technologies to enable network-based visualization, analysis, and biomedical discovery. Biomedical research is increasingly dependent on knowledge expressed in terms of networks, including gene, protein and drug interactions, cell-cell and viral-host communication, and vast social networks. Our technologies enable researchers to assemble and analyze these networks and to use them to better understand biological systems and, in particular, how they fail in disease. The NRNB mentoring organization includes projects such as Cytoscape (https://cytoscape.org/), WikiPathways (https://wikipathways.org/), SBML (https://sbml.org/), and cBioPortal (https://cbioportal.org/). This is a great opportunity to work at the intersection of biology and computing! For example, Cytoscape is downloaded over 18,000 times per month by researchers. We take mentoring seriously and are proud of our 95% success rate (https://www.nrnb.org/alumni.html#gsoc-tab) with former students and projects. But don’t take our word for it, read testimonials from prior NRNB students (https://www.nrnb.org/testimonials.html#student-tab) and (https://www.nrnb.org/testimonials.html#mentor-tab).
Synfig is a 2D open-source animation software. It is capable to produce vector artwork and also can work with bitmap images.
The main concept of Synfig is tweening - you can define object positions or shapes of vector objects at certain points of time and program will interpolate in-between frames automatically. You can also use bones to control your animation on higher level.
With Synfig you can easily create motion graphics and cut-out animations for product explanation videos, tutorial videos, and more.
The GNOME Foundation is a non-profit organization that believes in a world where everyone is empowered by technology they can trust. We do this by building a diverse and sustainable free software personal computing ecosystem.
FreeType is a freely available software library to render fonts.
It is written in C, designed to be small, efficient, highly customizable, and portable while capable of producing high-quality output (glyph images) of most vector and bitmap font formats.
Some products that use FreeType for rendering fonts on screen or on paper, either exclusively or partially:
• GNU/Linux and other free Unix operating system derivates like FreeBSD or NetBSD; • iOS, Apple's mobile operating system for iPhones and iPads; • Android, Google's operating system for smartphones and tablet computers; • ChromeOS, Google's operating system for laptop computers; • ReactOS, a free open source operating system based on the best design principles found in the Windows NT architecture; • Ghostscript, a PostScript interpreter used in many printers.
Counting the above products only, you get more than a billion devices that contain FreeType.
CGAL is a software library that offers a number of reliable geometric data structures and algorithms. CGAL components operate in 2D and 3D, and sometime in arbitrary dimensions. Examples of components include convex hulls, convex decomposition, Delaunay triangulations, Voronoi diagrams, polygonal surface mesh data-structures, mesh generation, Boolean operations, envelope computations, intersection detection, surface reconstruction, and subdivision surfaces.
CGAL is used in a variety of application domains such as CAD/CAM (computer aided design and modeling), GIS (geographic information systems), geophysics, image processing, molecular biology, robotics, motion planning, and graphics.
CGAL is written in C++ and rigorously adheres to the generic-programming paradigm.
CGAL became an Open Source project in 2003. Most of CGAL is under the GPL v3+ license, and some core parts are under the LGPL v3+. The semi-annual releases have currently about 10,000 downloads. CGAL is commercially supported by the spin-off company GeometryFactory.
Ultimate open source team chat and communications platform
Rocket.Chat is one of the largest active open source (permissive MIT source license) nodeJS communications platform communities on GitHub, connecting 1500+ global community contributors (across projects) from 30+ countries, with 31700+ GitHub stars, 6500 forks, 490+ total releases and 12,000+ issues since inception in 2015.
Rocket.Chat is now installed on over 500k servers and counts over 12m users worldwide. Rocket.Chat’s long-term vision is to replace email with a real-time federated communications platform globally.
Users can set up Rocket.Chat on cloud or by hosting their own servers on-premises. Thanks to its extension support via Rocket.Chat Apps (plugins), and extremely rich API/SDK support, startups and innovators have customized Rocket.Chat into new products and services. Chatbots extend the interactivity of Rocket.Chat and integrate external IT systems. Omnichannel extends reach to wherever user may be including WhatsApp, Instagram, Facebook Messenger and more.
Rocket.Chat has won multiple prizes such as a 2016 Bossie Award for Best Open Source Application and first prize in the 2017 edition of All Things Open’s Startup Competition.
Rocket.Chat supports over 59 local languages. Rocket.Chat's community interacts 24 x 7 at the community Rocket.Chat server https://open.rocket.chat since 2015.
Our mission is to promote software learning within the arts, artistic learning within technology-related fields, and to celebrate the diverse communities that make these fields vibrant, liberatory, and innovative. Our goal is to support people of all backgrounds in learning how to program and make creative work with code, especially those who might not otherwise have access to tools and resources. We also believe that some of the most radical futures and innovative technologies are being built by communities that have been pushed to the margins by dominant tech. We hope to support those who have been marginalized by technology in continued self-determination by providing time, space, and resources.
Community of technologists, designers, writers and advocates who work to ensure freedom for all people through our software.
Python Software Foundation
Python is a programming language that lets you work more quickly and integrate your systems more effectively.
The Python Software Foundation serves as an umbrella organization to a variety of Python-related projects, as well as sponsoring projects related to the development of the Python language.
You can view a full list of participating sub-orgs here: https://python-gsoc.org/ideas.html
Sub-orgs: - Borg Collective - backup tools - CVE Binary Tool - scanning for known security vulnerabilities - DIPY - 3d/4d+ imaging - Fury - scientific visualization tools - LPython - ahead of time compiler for python - MNE-Python - tools for human neurophysiological data - Mission Support System - atmospheric science tools for flight planning - PyData/Sparse - n-dimensional sparse arrays for pyData - PyElastica - simulation and modeling for slender structures
GNU is an operating system that is free software—that is, it respects users' freedom. The GNU operating system consists of GNU packages (programs specifically released by the GNU Project) as well as free software released by third parties. The development of GNU made it possible to use a computer without software that would trample your freedom.
The GNU Project is the organization of maintainers and developers, webmasters, translators, and other contributors, that develop the more than 400 programs conforming the GNU OS.
MariaDB Foundation is the non-profit organization behind MariaDB Server, the fastest growing open source databases. MariaDB Foundation's mission is to ensure the continuity of the MariaDB Server code as well as foster and facilitated collaboration within the MariaDB ecosystem. As part of GSoC, MariaDB Foundation seeks to bring more developers into the MariaDB Server (and related projects) code base.
The STE||AR Group is an international team of researchers who understand that a new approach to parallel computation is needed. Our work is crafted around the idea that we need to invent new ways to more efficiently use the resources that we have and use the knowledge that we gain to help guide the creation of the machines of tomorrow. This organization aims to support, coordinate, and distribute research done in this area by creating a marketplace of ideas where concepts are debated and solutions are transparently developed.
mlpack is an intuitive, fast, and flexible C++ machine learning library with bindings to other languages. It is meant to be a machine learning analog to LAPACK, and aims to implement a wide array of machine learning methods and functions as a swiss army knife for machine learning researchers. In addition to its powerful C++ interface, mlpack also provides command-line programs and Python bindings, as well as bindings to other languages.
The Rizin project is a fork of the famous Radare2 project that started in 2006. Since then the codebase has been rewritten multiple times, modularized and extended to support many new features. The Rizin project aims to provide stability, focus on the most important features, and provide a user friendly interface. Along with Cutter - a Qt-based GUI and the RzGhidra decompiler it makes the effective tool for everyday reversing tasks.
Rizin is composed of a hexadecimal editor at its core, with support for several architectures and binary formats. It features code analysis capabilities, scripting, data and code visualization through graphs and other means, a visual mode, easy UNIX integration, a binary diffing engine for code and data, a shellcode compiler, multi-platform debug with reverse debug capabilities and much, much more!
TARDIS RT Collaboration
TARDIS is a tool that creates synthetic observations (spectra) for exploding stars (supernovae).
A supernova marks the brilliant death throes of a star, during which it outshines its entire galaxy. Through their explosive stellar death, supernovae enrich the Universe with new elements necessary for the formation of planets and life as we know it. From the iron in your blood to the silicon in your laptop, supernovae are responsible for producing many important elements from the primordial hydrogen and helium left over from the Big Bang.
TARDIS provides a link between theory and observations: by creating synthetic spectra from theoretical assumptions and comparing these to observations, we can both interpret data and test models for why, when and how supernova explosions occur. We, the community around TARDIS, are interested in combining astronomy, computer science, statistics and modern software design to build a tool that is useful both in research and teaching alike (with supporting documentation that would, in theory, allow anyone to recreate the project from scratch). Please join us on https://gitter.im/tardis-sn/gsoc.
Cockpit is an interactive server admin interface. It is easy to use and very lightweight.
Cockpit interacts directly with the operating system from a real Linux session in a browser.
Cockpit makes Linux discoverable, allowing sysadmins to easily perform tasks such as starting containers, storage administration, network configuration, inspecting logs and so on.
Jumping between the terminal and the web tool is no problem. A service started via Cockpit can be stopped via the terminal. Likewise, if an error occurs in the terminal, it can be seen in the Cockpit journal interface.
You can also easily add other machines that have Cockpit installed and are accessible via SSH and jump between these hosts.
Read more about cockpit in https://cockpit-project.org.
OpenWISP is an open source network management system which runs on low cost hardware and can be used to manage networks: from public wifi, university wifi, to mesh networks and IoT.
52°North Spatial Information Research GmbH
52°North is an open source initiative in the field of geoinformatics. Core topics of our activities are for example sensor web, web-based geoprocessing and earth observation.
Postman is an API platform for building and using APIs. Postman simplifies each step of the API lifecycle and streamlines collaboration so you can create better APIs—faster.
Purr Data is a visual programming language with an emphasis on generating sound, video, 2D/3D graphics, and connecting through sensors, input devices, and MIDI as well as OSC devices. It has an HTML5 frontend. It also has realtime DSP backend written in C which can run natively or in the browser.
Purr Data has a special emphasis on generating audio and/or video in real time, with low latency. Much of its design focuses on receiving, manipulating, and delivering high-quality audio signals. Specifically, the software addresses the problem of how to do this efficiently and reliably on general purpose operating systems like macOS, Windows, Debian, etc.-- i.e., systems designed mainly for multi-tasking.
Purr Data can easily work over local and remote networks. It can be used to integrate wearable technology, motor systems, lighting rigs, and other equipment. Purr Data is also suitable for learning basic multimedia processing and visual programming methods, as well as for realizing complex systems for large-scale projects.
Apache Software Foundation
The Foundation provides an established framework for intellectual property and financial contributions that simultaneously limits contributors potential legal exposure. Through a collaborative and meritocratic development process, Apache projects deliver enterprise-grade, freely available software products that attract large communities of users. The pragmatic Apache License makes it easy for all users, commercial and individual, to deploy Apache products.
Jenkins is a popular open source automation server which is used for building, testing, CI/CD, deployment and many other use-cases. Our motto is Build great things at any scale.
Jenkins, originally founded in 2006 as Hudson, is one of the leading automation servers. Jenkins' motto is Build great things at any scale. Using an extensible, plugin-based architecture, developers have created hundreds of plugins to adapt Jenkins to a multitude of build, test, and deployment automation workloads. As Jenkins is open-source, MIT License is used for most of the components.
This year we invite potential GSoC contributors to join the Jenkins community and to work together to improve Jenkins. We have many strategic project ideas which are important to potentially hundreds of thousands of Jenkins users.
The Jenkins project is a part of Continuous Delivery Foundation (CDF).
Learning and practicing AI is not easy. Deploying AI in real applications is challenging and complex. We realized that and created OpenVINO – an open-source toolkit for bringing AI models to life on the most widespread and available platforms like x86 CPUs and integrated Intel GPUs. One can use OpenVINO to optimize own model for high performance with the most advanced model optimization capabilities and run those through OpenVINO runtime on various devices.
We are driven by the idea of making it easy for a regular person to learn and use AI with minimal investments and unlock innovation with AI to a broad audience. We allow AI inference on the most widespread and popular platforms and provide pre-trained models, educational and visual demos, image annotation tools, and model training templates to get you up to speed. With good use-case coverage and simple/educational API, OpenVINO is also a toolkit of choice for AI in production by plenty of companies. It is a fully production-ready software solution.
Qdrant is powering the next generation of AI applications with advanced and high-performant vector similarity search technology. Our main project is the Vector Search Engine and Database. It is open-source and written in Rust 🦀. With Qdrant, embeddings or neural network encoders can be turned into full-fledged applications for matching, searching, recommending, and much more!
MoveIt is the most widely used software for robotic manipulation and has been used on over 150 robots. It is released under the terms of the BSD license, and thus free for industrial, commercial, and research use.
By incorporating the latest advances in motion planning, manipulation, 3D perception, kinematics, control and navigation, MoveIt is state of the art software for mobile manipulation.
Learning Equality is an education technology nonprofit that develops and maintains Kolibri, an adaptable set of open solutions specially designed to support teaching and learning with technology but without the Internet for the half of the world that still lacks access to connectivity. Kolibri is centered around an offline-first learning platform that runs on a variety of low-cost and legacy devices. It is complemented by a curricular tool, a library of open educational materials, and a toolkit of resources to support training and implementation. These tools are open and available in a variety of languages, to better support learners and educators globally. As a community-driven nonprofit, Learning Equality works closely to co-design Kolibri with a core network of collaborators, including national NGOs, UN agencies, government, and corporate partners. We also adopt a needs-based approach, constantly gathering insights from our community to inform the development of our tools. Through its do-it-yourself adoption model and strategic collaborations, Kolibri has reached learners and educators in more than 220 countries and territories since its launch in 2017.
Mayor's Office of New Urban Mechanics
Formed in 2010, the Mayor's Office of New Urban Mechanics serves as the City of Boston's innovation incubator, building partnerships between internal agencies and outside entrepreneurs to pilot projects that address resident needs. We design and implement experiments and prototypes that address a range of urban issues. From community green spaces to the future of mobility to Accessory Dwelling Units, our approach to innovation is human-centered, nimble and responsive to the changing needs of our growing City.
Mantis is a platform to build an ecosystem of realtime stream processing applications. Mantis provides a robust, scalable platform ideally suited for high volume, low latency use cases like anomaly detection and alerting. Mantis has been in production at Netflix since 2014. It processes trillions of events and petabytes of data every day.
Enabling individuals to define the connected devices in their lives, BeagleBoard.org is the open-source, community-driven hardware precursor to Raspberry Pi, continuing to improve performance, access and openness for embedded development. The community is diverse with many professional developers utilizing the open source designs to build production solutions. Supported by a non-profit foundation, the community seeks to improve access to technology for making embedded devices using open source software and hardware. On-going developments include open designs around DSP/AI/ML accelerators, RISC-V cores, FPGA-based open hardware developement, software-defined radios, robotics/motor control, privacy-oriented personal servers, musical instruments, lighting displays, and open standards that simplify and clarify embedded systems technology.
The ENIGMA Team
ENIGMA is a game development environment comprising an IDE, Compiler, and Game Engine.
ENIGMA is meant to be simple to learn and even simpler to work with. ENIGMA makes it refreshingly easy to stand up simple games. Recreating Atari Pong is a job for a handful of minutes. This is true in many modern engines, but unlike most, ENIGMA scales well to positively enormous games, as the engine (and ultimately, all user code) is written in C++.
The compiler bundles a C++ parser which crawls the engine for definitions. This theoretically allows users to access library routines directly (e.g. OpenGL), though we recommend against this for portability reasons. This framework is another good surface for GSoC projects, though it's more advanced, so we've left it unexplored in our project ideas list.
To really understand the utility of the project, consider a typical game, which uses rooms (a type of ENIGMA asset) to lay out instances of objects (game entity classes) visually. The objects contain code for specific events (such as step, draw, or keyboard) which can call out to scripts or begin timelines or start movement on paths. ENIGMA's compiler translates this to C++ and builds it against the engine after parsing both to learn how to integrate them.
Climate change science has witnessed an explosion in the amount and types of data that can be brought to bear on the potential responses of the terrestrial carbon cycle and biodiversity to global change. Many of the most pressing questions about global change are not necessarily limited by the need to collect new data as much as by our ability to synthesize existing data. This project specifically seeks to improve this ability. Because no one measurement provides a complete picture, multiple data sources must be integrated in a sensible manner. Process-based models represent an ideal framework for integrating these data streams because they represent multiple processes at different spatial and temporal scales in ways that capture our current understanding of the causal connections across scales and among data types. Three components are required to bridge this gap between the available data and the required level of understanding: 1) a state-of-the-art ecosystem model, 2) a workflow management system to handle the numerous streams of data, and 3) a data assimilation statistical framework in order to synthesize the data with the model.
Internet Health Report
The Internet Health Report monitors the conditions of networks that compose the Internet. It aims to provide network operators, policymakers, and other stakeholders, with a better understanding of the Internet's infrastructure and its evolution.
The Linux Foundation
The Linux Foundation is a non-profit consortium dedicated to fostering the growth of Linux. Founded in 2007 as a merger of the former Free Standards Group (FSG) and the former Open Source Developer Lab (OSDL), the LF sponsors the work of Linux creator Linus Torvalds and is supported by leading Linux and open source companies and developers from around the world. The Linux Foundation promotes, protects and standardizes Linux by providing unified resources and services needed for open source to successfully compete with closed platforms. For more see our About page. All software produced by us is free software published under OSI-approved licenses. See project ideas page for the license used by each project.
The Honeynet Project
The Honeynet Project is a leading international 501c3 non-profit security research organization, dedicated to investigating the latest attacks and developing open source security tools to improve Internet security. With Chapters around the world, our volunteers have contributed to fight against malware (such as Conficker), discovering new attacks and creating security tools used by businesses and government agencies all over the world.
The Honeynet Project uses GSoC as a incubator for new R&D projects, and to recruit active new members.
When working with different technologies and developers of different programming languages, the productivity of the entire team worsens due to the lack of interoperability and communication between them. If the developers need two technologies which are written in different programming languages, for instance, a C/C++ library called from NodeJS, the team usually needs to port to one of the two languages or write a wrapper around them. Maintaining a port of a library or the plumbing code is frequently error-prone, time-consuming, and does not add any value to the final product.
The main objective of MetaCall is to provide a transparent interoperability in both ways, no matter what language you are using, so you feel like you are using a library written in the same language but in fact, it may be written in C, NodeJS or any other language.
MetaCall currently provides a mechanism to introspect the types and function signatures, which allows us to provide this type info to the caller language. You can have type safety and at the same time avoid boilerplate in both directions.
It addresses the main shortcomings of embedding independent languages separately. Having a common implementation with a plugin architecture allows you to implement newer languages without rewriting more code. With a single solution you get C#, Ruby, Python or any other language you prefer. We can improve the core continuously and add new languages.
Finally, we are using the polyglot runtime in cloud computing so we take advantage of the interoperability capabilities and allow to build complex polyglot distributed systems with ease. It is possible to build monolithic and mono-repo projects that can be distributed and scaled separately through our Function as a Service built on top of MetaCall, allowing the developer to maximize the productivity without the need of DevOps plumbing or thinking about intercommunication protocols and architectural details.
We build Wagtail, a popular content management system. It's built on Python, by an active and engaged open source community, which has grown rapidly since Wagtail's release in 2014. Wagtail is available in over 40 languages, and used by some of the world's best-known organizations, including NASA, Google, Mozilla, MIT, and the UK's National Health Service, as well as museums, universities, non-profits, governments, banks, studios, restaurants, startups and bloggers around the world.
The Mandiant FLARE team is a collection of about 40 reverse engineers that analyze malware in support of threat intel, incident response, and computer forensic investigations. We spend our days using disassemblers, debuggers, decompilers, and emulators to figure out what malware does and how we can contain it. We’re known for delivering training sessions that share our experience and releasing open source software that automates the boring things. If you have even a passing interest in reverse engineering or malware analysis, reach out so that we can chat!
International Catrobat Association
Computational thinking for all with free visual coding apps The Catrobat project develops useful frameworks to create games, animations, or apps easily within a short time. This set of mobile creativity tools for smartphones is inspired by the well-known Scratch framework by the Lifelong Kindergarten Group at the MIT Media Lab. The motivation behind the project is that programming is an important cultural technique on the same level as mathematics and physics, from a practical as well as from a philosophical point of view. Our aim thus is to popularize the skills needed to program from an early age in a fun and engaging way that will facilitate the spread of its adoption among young people all over the world.
Our awarded Android app “Pocket Code” is currently the most famous outcome of the project. Without the need for any further devices, users have the possibility to create their first program directly on their mobile device in just a few steps using visual Bricks. Pocket Code supports all common device sensors, provides special Bricks for different robotic devices (Lego Mindstorms, Robotix Phiro, etc.) as well as for hardware devices such as the Arduino board or the Raspberry Pi, and of course offers elements of programming languages such as variables, if-statements, concurrency, etc. We also work on Pocket Code for iOS and on a large number of extensions. That’s why developers of different fields help us to keep our products up-to-date to meet the current needs of our users.
Motivated by prizes (such as the Lovie Award, the Austrian National Innovation Award or the Re-Imagine Education Award) and being featured by different programs (like Google Play for Education or code.org), our team is working on many different subprojects and extensions. Over 500 developers already contributed to our project on different topics such as app development, web technologies, graphics, usability, internationalization, or design.
An open-source storage platform that implements storage on a single distributed computer cluster and provides a 3-in-1 interface for object-, block- and file-level storage.
Django Software Foundation
Django is a high-level Python Web framework originally developed at the Lawrence-Journal World. Django was designed to handle two challenges: the intensive deadlines of a newsroom and the stringent requirements of the experienced Web developers who wrote it. It lets you build high-performing, elegant Web applications quickly.
Machine Learning for Science (ML4SCI)
Machine Learning for Science (ML4SCI) is an umbrella organization for machine learning-related projects in science. ML4SCI brings together researchers from universities and scientific laboratories with motivated students to join existing scientific collaborations and contribute to cutting edge science projects across a wide variety of disciplines. Students work on existing problems to develop new machine learning-based approaches and produce open source code that directly contributes to solving these scientific challenges.
ML4SCI currently includes projects from a variety of fields. For example, some of them explore the uses of machine learning for particle reconstruction and classification in high-energy physics, deep learning-based searches for dark matter in astrophysics, applications of machine learning techniques to data returned from planetary science missions, applications of quantum machine learning to science, and others.
Machine learning ideas and approaches can be broadly applicable and transferable across the scientific domains. The goals of ML4SCI projects are to grow the open-source community in machine learning for science by addressing important scientific challenges and transferring the knowledge and tools of machine learning across the disciplines. We look forward to your applications!
The Julia Language
The Julia Language is an open-source, high level, and dynamic language built to be easy to use like Python while having speed near C++. As an umbrella organization, we house projects related to core Julia (the language) as well as the broader Julia ecosystem.
Open Genome Informatics
The Open Genome Informatics group represents an umbrella organization consisting of several open source and open access genomics and bioinformatics projects worldwide. Our goals are to develop and maintain a collection of sustainable software tools for managing, analyzing, visualizing, storing, and disseminating genomic data.
The VideoLAN project is led by and composed of a team of volunteers who believe in the power of open source to rock the multimedia world. We produce multimedia software notably the famous VLC media player as well as designated libraries and codecs.
Zulip is the only modern team chat app that is ideal for both live and asynchronous conversations. Zulip has a web app, a cross-platform mobile app for iOS and Android, cross-platform desktop and terminal apps, and over 100 native integrations. The entire Zulip codebase is 100% open source.
Zulip has been gaining in popularity since it was released as open source software in late 2015, with code contributions from over 1000 people from all around the world. Thousands of people use Zulip every day, and your work on Zulip will have meaningful impact on their experience.
As an organization, we value engaged, responsive mentorship and making sure our product quality is extremely high. You can expect to receive disciplined code reviews by highly experienced engineers. Since Zulip is a team chat product, your GSoC experience with the Zulip project will be highly interactive.
As part of our commitment to mentorship, Zulip has over 160,000 words of documentation for developers, much of it designed to explain not just how Zulip works, but why Zulip works the way that it does.
To learn more about the experience of doing GSoC with Zulip, check out our blog post: https://blog.zulip.com/2021/09/30/google-summer-of-code-2021/.
Scalable Parallel Computing Laboratory
Scalable Parallel Computing Laboratory (SPCL) perform research in all areas of scalable computing. The research areas include scalable high-performance networks and protocols, middleware, operating system and runtime systems, parallel programming languages, cloud computing and serverless, support, and constructs, storage, and scalable data access.
The JPF team
The Java Pathfinder (JPF) project was initially conceived and developed at NASA Ames Research Center in 1999. JPF was open sourced in April 2005 as one of the first ongoing NASA development projects to date, and it is now released under the Apache license, 2.0. JPF is an extensible Java virtual machine written in Java itself. It is used to create a variety of verification and debugging tools, ranging from software model checkers to test case generators using symbolic execution. JPF is a research platform and a production tool at the same time. Although JPF has major contributions from companies and government agencies, our main user community is academic - there are ongoing collaborations with more than 20 universities worldwide. The JPF team for GSoC 2023 includes researchers from NASA Ames Research Center, KTH Royal Institute of Technology - Sweden, York University - Canada, Brigham Young University, Carnegie Mellon University, University of Minnesota, Charles University - Czech Republic, and National University of Singapore.
JPF is designed to be highly extensible. There are well-defined extension mechanisms, directory structures and build procedures, which keep the core relatively stable and provide suitable, well separated testbeds for new ideas and alternative implementations. As a consequence, we host a number of such extension projects on our own, public JPF server, together with a Wiki that provides a central location for learning about, obtaining, and contributing to JPF.
JPF has been used for a variety of application domains and research topics such as verification of multi-threaded applications, graphical user interfaces, networking, and distributed applications. In addition to its continued presence in academia, JPF has matured enough to support verification of production code and frameworks such as Android. JPF is constantly being extended with support for verification of new types of correctness properties and for new types of application domains.
Red Hen Lab
Red Hen Lab is an international distributed cooperative of researchers in multimodal communication. We are senior professors at major research universities, senior developers in technology corporations. We also include junior professors, postdoctoral students, graduate students, undergraduate students, and even a few advanced high school students. Red Hen develops code in Natural Language Processing, ASR, audio parsing, gesture analysis, media analysis, computer vision, and multimodal analysis.
Red Hen's multimodal communication research involves locating, identifying, and characterizing auditory and visual elements in videos and pictures. We may provide annotated clips or images and present the challenge of developing the machine learning tools to find additional instances in a much larger dataset. Some examples are gestures, eye movements, and tone of voice. We favor projects that combine more than one modality, but have a clear communicative function—an example would be floor-holding techniques. Once a feature has been successfully identified in our full dataset of several hundred thousand hours of news videos, cognitive linguists, communication scholars, and political scientists can use this information to study higher-level phenomena in language, culture, and politics and develop a better understanding of the full spectrum of human communication. Our dataset is recorded in a large number of languages, giving Red Hen a global perspective.
For GSoC 2022, we invite proposals from students for components for a unified multimodal processing pipeline, whose aim is to extract information from text, audio, and video, and to develop integrative cross-modal feature detection tasks. Red Hen Lab is directed jointly by Francis Steen (UCLA) and Mark Turner (Case Western Reserve University).
LibreOffice is a modern Free & Open Source Office suite, one of the largest open source projects, and used by millions of users worldwide. LibreOffice is compatible with many file formats like Microsoft® Word, Excel, PowerPoint and Publisher. At its heart though, LibreOffice is built around an open standard, the OpenDocument Format, as its native file format. LibreOffice is developed by users who, just like you, believe in the principles of Free Software and in sharing their work with the world in non-restrictive ways. The development of LibreOffice is supported by The Document Foundation which provides the infrastructure for the project. We believe that users should have the freedom to run, copy, distribute, study, change and improve the software that we distribute. While we do offer no-cost downloads of the LibreOffice suite of programs, Free Software is first and foremost a matter of liberty, not price. We campaign for these freedoms because we believe that everyone deserves them. Though the members of our community hail from many different backgrounds, we all value personal choice and transparency, which translates practically into wider compatibility, more utility, and no end-user lock-in to a single product. We believe that Free Software can provide better-quality, higher-reliability, increased-security, and greater-flexibility than proprietary alternatives. LibreOffice is a large project (approx. 6MLOC), which makes it interestingly complex, but at the same time, provides a place for all sorts of contribution & skills. The community behind LibreOffice is the heart of the project, without which we would not have the resources to continue developing our software. The passion and drive that every individual brings to the community results in collaborative development that often exceeds our own expectations. With tons of different roles in the project, we invite everyone to join us in our work and help us to make LibreOffice known, prosper, and accessible to all.
GnuTLS is a secure communications library implementing the SSL, TLS and DTLS protocols and technologies around them. It provides a simple C language API to access the secure communications protocols.
InVesalius is an Open Source organization that works developing free software for reconstruction of computed tomography and magnetic resonance images. The software is mainly used for rapid prototyping, teaching, forensics, and in the medical field. It is possible to use it on the Microsoft Windows, GNU/Linux and Apple Mac OS X platforms.
InVesalius main software started began their development on 2001 by Centro de Tecnologia da Informação Renato Archer (CTI), in Brazil, later it was released under GNU license and more practitioners joins the organization to improve their development.
At that time, there was no medical image software in Portuguese that fulfilled the Brazilian hospitals and clinics needs. Therefore, InVesalius came as a proposal of development with the aim to be a medical software image analysis with null acquisition cost, capability of execution on low-cost personal computers and the capability of execution on different operating systems and act as a platform to encourage the use and development of medical images in Brazil.
The Chromium projects include Chromium and Chromium OS, the open-source projects behind the Google Chrome browser and Google Chrome OS, respectively.
Chromium is an open-source browser project that aims to build a safer, faster, and more stable way for all users to experience the web.
Chromium OS is an open-source project that aims to provide a fast, simple, and more secure computing experience for people who spend most of their time on the web.
GitLab is The DevOps Platform that empowers organizations to maximize the overall return on software development by delivering software faster and efficiently, while strengthening security and compliance. With GitLab, every team in your organization can collaboratively plan, build, secure, and deploy software to drive business outcomes faster with complete transparency, consistency and traceability.
Kiwix provides copies of websites that can be browsed offline. We run scrapers that will crawl a given website and compress it into a single .zim archive (based on the openZIM format).
The zim files can then be stored locally and read on the fly by Kiwix in such a way that the user experience is similar to being online.
We can fit the entirety of Wikipedia on a regular Android phone, but there are more than 7,000 zim files available in 100+ languages, mostly focused on educational content (e.g. Wikipedia, StackOverflow, Khan Academy, etc.).
Kiwix runs on all platforms (Linux, Windows, Android, etc.) and has around 8 million users worldwide, in pretty much any place you can think of that has limited or no connectivity: prisons, rural schools, refugee camps, even Antarctic bases!
Our big challenge is to make it as easy as possible to access or share offline content.
Plone is a Python-based CMS built with workflow and security at the forefront. It is developed by a world-wide community and uses modern technologies and techniques. It comes with a complete API and a React-based frontend. Also home to Guillotina.io, a full stack data framework, and the Zope and ZODB object database.
Society for Arts and Technology (SAT)
The Society for Arts and Technology [SAT] is a non-profit organization recognized internationally for the development and creative use of immersive technologies, virtual reality and high-speed networks. Founded in 1996, its triple mission as a center for the arts, training and research, is to enable and host teleimmersive multisensorial experiences.
SAT creates open-source tools for: Immersive rendering (Mirador, Puara, Poire), Interaction (LivePose), Projection mapping (Splash Calimiro), Telepresence (Satellite hub, Scenic, Switcher), Audio spatialization (Audiodice, SATIE, vaRays), Haptic Feedback (Haptic Floor, Telluriq), Data sharing (shmdata).
SAT contributes to opens-source tools for: Glue and inter-operability (libmapper), Immersive multisensory multimedia authoring (blender, OSSIA score, webmapper).
We are an independent not-for-profit center established at EPFL. Scala Center's Mission is to guide and support the Scala community, coordinate and develop open-source libraries and tools for the benefit of all Scala users, provide deep, and quality, educational materials for Scala.
Sugar is an activity-focused, free/libre open-source software learning platform for children. Collaboration, reflection, and discovery are integrated directly into the user interface. Through Sugar's clarity of design, children and teachers have the opportunity to use computers on their own terms. Students can reshape, reinvent, and reapply both software and content into powerful learning activities. Sugar's focus on sharing, criticism, and exploration is grounded in the culture of free software (FLOSS)
Open Technologies Alliance - GFOSS
Open Technologies Alliance (GFOSS) is a non-profit organization, with 37 Universities and Research Centers as its shareholders. Our main goal is to promote Openness. GFOSS – Open Technologies Alliance is a platform for Open Standards, Free Software, Open Content, Open Data & Open Hardware in Greece. The major Greek Universities and Research Centers participate in GFOSS – Open Technologies Alliance, while leading members of the Greek community of developers play a key role in the implementation of our policies. Through our initiatives we aspire to contribute and coordinate the efforts of groups of volunteers, public servants, university researchers and students enabling them to form the backbone of Greek FOSS development and implementation. GFOSS is one of the strategic actors for the promotion of OSS throughout Greece (see https://joinup.ec.europa.eu/sites/default/files/inline-files/OSS%20Country%20Intelligence%20Report_GR.pdf ). Many public administrations and academic institutions collaborate with GFOSS to implement open source projects and through Google Summer of Code we give the opportunity to students to actively engage in the production and the actual implementation of an open source project. GFOSS also contributes and advises on the development of various open source projects related to e-government and digital transformation in Greece (e.g. https://howto.gov.gr/ - https://forma.gov.gr/) and actively promotes the use of Open Source software and hardware in the Greek primary and secondary education through the Open Educational Technologies Competition (https://openedtech.ellak.gr/ )
We are an Australian not-for-profit umbrella organization for open-source projects. We believe the open-source philosophy provides a resource-efficient channel to transfer knowledge and achieve innovation and education.
In 2023, we offer the following projects:
We have a diverse group of mentors, including GSoC students from previous years who decided to become long-term contributors as well as academics with extensive experience in supervising undergraduate, M.Sc. and PhD students on theses and projects, whose results are often published and presented in the most prestigious conferences of our research fields.
coreboot provides a fast, reliable, secure, and predictable boot-firmware solution for numerous modern and legacy chipsets. There are millions of devices running coreboot, including Google Chromebooks and Chromeboxes, and it runs in many datacenters. In 2021, coreboot.org had over 300 active developers, and over 6000 commits. The coreboot project believes in the principles of Open Source software. It borrows many well known concepts from other Open Source projects: Kconfig, Linux kernel coding style, git repository, and Gerrit for code reviews. coreboot has numerous advantages over other firmware projects. It comes with SoC support available to all developers. The codebase is shared with developers and companies which are helping to improve the SoC and driver-specific implementations.
** Extremely fast ** - coreboot is designed to do the minimum amount necessary and get out of the way. - Desktop and Notebooks boot up in 400ms - 2.5s. - Server boot time can be as low as a quarter of the OEM BIOS boot time.
** Flexible and customizable ** - Because coreboot only does hardware init, then jumps to a payload to continue, it is very flexible. - GPLv2 licensed core. Payloads can be proprietary and we offer a BSD licensed support library. coreboot provides ready-to-build payloads like UEFI, SeaBIOS, U-Boot, Grub2 and many more. - Recovery Mode based on multiple copies of the firmware which can be updated independently. - Integrity of stages and binaries in coreboot can be easily verified. - Designed for security by default. - Uses a minimal trusted computing base for each platform which is easily auditable, helping to guarantee security. As coreboot is Open Source, anyone can check the codebase.
** Debugging ** - There are multiple ways to extract the boot log, from classic serial to EHCI debug, even over the pc-speaker. - GDB Stub support via serial. - In memory console log support. - Tracing functions and other exotic features are supported.
Blender is a free and open source 3D creation suite, providing individuals and small teams a complete pipeline for 3D graphics, modeling, animation and games.
Blender is being made by 100s of active volunteers from all around the world; by studios and individual artists, professionals and hobbyists, scientists and students, vfx experts and animators, and so on. All of them are united by an interest to have access to a fully free/open source 3D creation pipeline. Blender Foundation supports and facilitates these goals - even employs a staff for that - but fully depends on the online community to achieve it.
Free and Open Source Silicon Foundation
The FOSSi (Free and Open Source Silicon) Foundation is a not-for-profit organization with the support the growing community of open source silicon hardware. We do this with a variety of activities and through Google Summer of Code we bring together enthusiastic mentees and outstanding projects. Under our umbrella are open source silicon hardware projects, operating systems and compilers for such projects, tools for electronic design automation and the related ecosystem.
SPDX is an open standard for communicating software bill of material information, including provenance, license, security, and other related information. SPDX reduces redundant work by providing common formats for organizations and communities to share important data, thereby streamlining and improving compliance, security, and dependability. The SPDX specification is recognized as the international open standard for security, license compliance, and other software supply chain artifacts as ISO/IEC 5962:2021.
caMicroscope is a digital pathology data management, visualization and analysis platform. It consists of a set of web services to manage digital pathology images, associated clinical and imaging metadata, and human/machine generated annotations and markups. The image visualization client consists of HTML5 based web clients that support interactive rendering of the digitized image, and the creation and display of annotations and markups. In addition to the core caMicroscope digital pathology management framework, caMicroscope organization also consists of several other open source tools, in the domains of healthcare and big data. caMicroscope is used by several medical research organizations.
OpenStreetMap is a crowdsourcing project that creates and distributes free geographic data for the world. Our data is collected by hundreds of thousands of contributors around the globe and released with an open-content license. We allow free access not only to our map products, but all the underlying map data, which powers websites and apps used by billions of people worldwide.
OSM data can be freely used in both open and closed source software, and has attracted many commercial users. Still, the success of OSM wouldn't be possible without open source software and volunteer developers. The database, website and api running on our own servers, the editing tools used by contributors to improve the map, and many of the most popular libraries and end-user applications within the OSM software ecosystem are all open source software, and developed through a community-driven process.
As our Google Summer of Code participation spans this diverse set of software projects, most of which are maintained as independent efforts under the OSM umbrella, contributors will encounter a wide range of programming languages, paradigms and use cases. We hope that we have interesting challenges to offer for any developer, no matter their background!
Git is the most widely-used revision control system in Open Source. It is a distributed system with an emphasis on speed, data integrity, and support for distributed, non-linear workflows.
Many large and successful projects use Git, including the Linux Kernel, Perl, Eclipse, Gnome, KDE, Qt, Ruby on Rails, Android, PostgreSQL, Debian, and X.org.
This organization covers projects for Git itself. Other git-based software or services are not covered by this organization.
Stratosphere Laboratory, Czech Technical University in Prague
Stratosphere is a cybersecurity research laboratory at the AI Center FEE CTU in Prague, working at the intersection of cybersecurity, machine learning, and helping others.
Mathesar is a straightforward open source tool that provides a spreadsheet-like interface to a PostgreSQL database. Our web-based interface helps you and your collaborators work with data more independently and comfortably – no technical skills needed.
You can use Mathesar to build data models, enter data, and even build reports. You host your own Mathesar installation, which gives you ownership, privacy, and control of your data.
Genome Assembly and Annotation
The Genome Assembly and Annotation section of EMBL-EBI brings together key reference resources in the field of genomics:
Ensembl (http://www.ensembl.org) was created in 1999 in preparation for the publication of the first draft of the human genome, to allow researchers and clinicians to start translating the secrets hidden within the human genome into real world applications. Ensembl has grown into a champion of biodiversity, providing data for tens of thousands of species across our vertebrate, metazoa, plant, fungi and bacterial divisions.
MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify has more than doubled the number of publicly available analysed datasets held within the resource.
WormBase (https://wormbase.org/) is one of the World's oldest active bioinformatic resources, more than 20 years old. We scan all published literature and datasets on the model organism C. elegans, to create a very comprehensive resouce of genomics, strains, experiments, paper and people, aimed towards accelerating research and discoveries in fundamental biology as well as human health.
The Hugo Gene Nomenclature Committee (HGNC) and its sibling project the Vertebrate Gene Nomenclature Committee (VGNC) are jointly responsible for defining the official names of genes in human and key vertebrate species. This official nomenclature ensures that studies and results on the same gene can easily be aggregated.
Given the rapid pace of generation of genomics and sequencing data, we support a fast-evolving software stack, and are constantly investigating new solutions for data storage, processing, distribution and display.
Please visit our projects page for ideas on potential GSoC projects: https://www.ensembl.info/about/projects/
The QEMU Project includes the QEMU open source machine emulator and virtualizer and also acts as an umbrella organization for the KVM Linux kernel module and rust-vmm community.
When used as a machine emulator, QEMU can run operating systems and programs made for one machine (e.g. an ARM board) on a different machine (e.g. your own PC). By using dynamic translation, it achieves very good performance.
When used as a virtualizer, QEMU achieves near native performances by executing the guest code directly on the host CPU. QEMU supports virtualization when executing under the Xen hypervisor or using the KVM kernel module in Linux. When using KVM, QEMU can virtualize x86, ARM, server and embedded PowerPC, and S390 guests.
Cloud Native Computing Foundation (CNCF) serves as the vendor-neutral home for many of the fastest-growing open source projects, including Kubernetes, Prometheus, and Envoy.
Pitivi is a truly unique video editor.
It champions design and usability research: there is no eternal newbie. Pitivi's user interface is carefully designed to suit both the newcomer and the professional, to be efficient and intuitive.
Pitivi's backend is the GStreamer Editing Services (GES) library. Built atop the giant shoulders of the industry-standard GStreamer multimedia framework, GES reduces fragmentation and risk while allowing diversity for application writers and adaptability for all kinds of purposes.
Through GStreamer we have any multimedia building-block we might want in Pitivi.
🚀 With GPT-4, multimodal AI is coming! Jina AI empowers businesses and developers to create top-notch multimodal AI services.
🌐 Join our inclusive community for GSoC tasks and work with cutting-edge techs like LMOps and MLOps. 📎 https://jina.ai/community/
👭 We will host a GSoC x Jina AI webinar on March 23rd, 14:00-15:00 PM (CET). Join us as our experienced mentors provide an in-depth overview of their GSoC projects and answer any questions you have about the project requirements and expectations.
The Internet Archive is a non-profit digital library.
We are the home of the Wayback Machine.
We are dedicated to provide the most effective fuzzing frameworks. Our work includes AFL++, the most effective and flexible fuzzer, and libafl, a library to build your own fuzzer with the most modern techniques and technologies.
Carbon is a successor language approach designed around interoperability with C++ as well as large-scale adoption and migration for existing C++ codebases and developers.
GNU Compiler Collection (GCC)
The GNU Compiler Collection (GCC) is an optimizing compiler produced by the GNU Project supporting various programming languages, hardware architectures and operating systems. It includes front-ends for C, C++, D, Objective-C, Fortran, Ada, and Go, as well as libraries for these languages (such as libgcc and libstdc++). Modula-2, Cobol and Rust front-ends are under development too. GCC includes support for OpenMP, OpenACC and Static Analysis.
Support the development, distribution, and adoption of open source software for use in robotics research, education, and product development.
Apertium is a free/open-source machine translation platform, and the organisation focuses on primarily symbolic language technology for less-resourced languages.
The Metasploit Framework is both a penetration testing system and a development platform for creating security tools and exploits. The framework is used by network security professionals to perform penetration tests, system administrators to verify patch installations, product vendors to perform regression testing, and security researchers world-wide. The framework is written in the Ruby programming language and includes components written in C, many flavors of Assembly, Python, Powershell, PHP, and other languages.
The framework consists of tools, libraries, modules, and user interfaces. The basic function of the framework is a module launcher, allowing the user to configure an exploit module and launch it at a target system. If the exploit succeeds, the payload is executed on the target and the user is provided with a shell to interact with the payload. Hundreds of exploits and dozens of payload options are available.
Liquid Galaxy project
Liquid Galaxy is a remarkable panoramic system that is tremendously compelling. It started off as a Google 20% project to run Google Earth across a small cluster of PC's and it has grown from there! Open source applications such as the MPlayer video player have been extended to run on Liquid Galaxy. Liquid Galaxy hardware consists of one or more computers driving multiple displays. Liquid Galaxy applications have been developed using a master/slave architecture. The view orientation of each slave display is configured in reference to the view of the master display. Navigation on the system is done from the master instance and the location on the master is broadcast to the slaves over UDP. The slave instances, knowing their own locations in reference to the master, then change their views accordingly. The Liquid Galaxy Project, while making use of Google Earth software, does not develop the Google Earth code-base itself. Google Earth is not open source software, although it is free (as in beer). Instead, the Liquid Galaxy Project works on extending the Liquid Galaxy system with open source software both improving its administration and enabling open source applications, so that content of various types can be displayed in the immersive panoramic environment.
gprMax is open source software that simulates electromagnetic wave propagation. It uses Yee's algorithm to solve Maxwell’s equations in 3D using the Finite-Difference Time-Domain (FDTD) method.
It is designed for simulating Ground Penetrating Radar (GPR) and is used to model electromagnetic wave propagation in fields such as engineering, geophysics, archaeology, and medicine. There are a wide range of applications from assessing infrastructure such as bridges and roads, locating buried utilities, mapping glaciers, finding anti-personnel landmines, to detecting tumours in the human body, and exploring the sub-surface of Mars and the Moon.
gprMax is command-line-driven software written in Python with performance-critical parts written in Cython. It does not currently feature a graphical user interface (GUI) which allows it to be very flexible and scriptable software that can run in high-performance computing (HPC) environments, i.e. on supercomputers.
gprMax can be run on either CPU or GPU. The CPU solver has been parallelised using OpenMP which enables it to run on multi-core CPUs. The GPU solver has been developed using the NVIDIA CUDA programming model. gprMax also features a Messaging Passing Interface (MPI) task farm, which can operate with CPU nodes or multiple GPUs.
The Dart language gives you a fast developer experience and works on any platform. Dart powers hot reload enabling you to make a code change and instantly see results in your running app, and compiles to ARM and x64 machine code enabling quick app startup times for mobile, desktop and the web.
Dart powers Flutter, Google’s UI toolkit for building beautiful, natively compiled applications for mobile, web, and desktop from a single codebase.
Department of Biomedical Informatics, Emory University
Biomedical Informatics is a multidisciplinary field that is motivated by our desire to improve diagnosis, clinical care, and human health, through novel computational approaches to use (and learn from) biomedical and clinical data. We use our expertise in computer science and informatics by developing various enabling tools, technologies, and algorithms to solve specific biomedical and clinical applications. And in doing so help advance our understanding of disease and treatment, and also develop useful software and applications. Members of the department work in a variety of areas that range from machine learning, healthcare middleware that leverages cloud computing, clinical information systems, clinically oriented image analysis, and biomedical knowledge modeling. The driving applications for the various ongoing projects include cancer research, organ transplant, HIV, medical imaging, radiation therapy, and clinical data analytics. All development work that is undertaken is free and open-source. We have had a diverse set of successful GSoC projects in the past. In previous years, GSoC contributors have worked on diverse projects such as: geospatial systems for exploring microscopy environments that leveraged Hadoop; GPU accelerated pipelines for computational analysis of digitized biopsies; interactive visualization platforms for viewing massive images (>1GB); systems for data agnostic sharing of biomedical research datasets; Apache Drill based data integration and de-duplication platform for SQL and NoSQL databases; CNN based high throughput analysis of digitized biopsies; A GUI for Tensorflow; integrated architectures for biomedical data integration and federation; and information visualization of heterogeneous medical data. Many of these projects have been published in reputable journals and presented at major conferences. Some of the projects proved to be so successful that they were adopted in major national/international biomedical research initiatives.
OpenAstronomy is a collaboration between open source astronomy and astrophysics projects that are used by researchers and engineers around the world to study our universe either by analysing the data obtained from amazing instruments like the James Webb Space telescope, the Square Kilometer Array or the Solar Dynamic Observatory, developing very sophisticated numerical models (eg. FLASH) or designing interplanetary trajectories for human-made spacecraft. The analysis of such data helps multiple types of research, from being able to forecast solar storms to detect planets in other stars, to understanding how galaxies are formed to explain the expansion and the origin of the universe.
OpenAstronomy is currently formed by 16 organisations that develop tools for different aspects of astronomy. The range of topics covered by these projects is wide, for example: - Astropy is a general Python library for astronomy, providing common tools such as celestial coordinates, image processing, tabular data reading and writing, units and support for astronomy-specific file formats; - SunPy provides utilities for obtaining and representing solar physics data, with clients for some of the largest online solar physics data archives and solar specific analysis and visualisation code; - Glue is a data visualization application and library to explore relationships within and among related datasets. - Julia Astro is a set of packages for general astronomy and astrophysics analysis using Julia; - And more!
As a single organisation, we aim to strengthen collaborations between the different sub-organisations, and at the same time increase the awareness among our users on the capabilities of our sister projects.
The Ruby organization collects mentors and students working on the Ruby language (MRI), the Ruby packaging system (Bundler, RubyGems, and RubyGems.org), and other Ruby projects. Any Ruby OSS project is eligible to be included in the Ruby GSOC organization.
As the world’s largest non-profit organization concerned with software security, OWASP:
- Supports the building of impactful projects;
- Develops & nurtures communities through events and chapter meetings worldwide;
- Provides educational publications & resources
in order to enable developers to write better software, and security professionals to make the world's software more secure.
At Ivy, we are on a mission to unify all ML Frameworks.
Ivy makes it possible to add any function, library or model from any framework directly into any other framework, with one line of code.
This opens a new era of ML, where all frameworks are totally interoperable, once and for all.
We currently support for PyTorch, TensorFlow, Numpy and JAX.
SQLancer is an automated testing tool that finds logic bugs in database systems (rather than in the databases themselves). Logic bugs are bugs that cause a database system to compute an incorrect result for a given query and a database on which the query is executed. To that end, SQLancer semi-randomly generates databases and queries, whose results are then automatically validated. SQLancer has found hundreds of bugs in database systems such as SQLite, MySQL, DuckDB, CockroachDB, and TiDB.
The Drupal Association is the non-profit organization focused on accelerating Drupal, fostering the growth of the Drupal community, and supporting the project’s vision to create a safe, secure, and open web for everyone.
AboutCode.org is a community of open source developers who are trying to make open source easier to use by providing open source tools to discover, identify and track open source components (aka. Software Composition Analysis – SCA). This includes tools, data and standards for code origin, FOSS licenses and security vulnerabilities.
CloudCV is a young open source cloud platform started in 2013 by students and faculty from Machine Learning and Perception Lab at Virginia Tech (now at Georgia Tech) with the aim to make AI research more reproducible. At CloudCV, we are building tools that enable researchers to build, compare and share state-of-the-art algorithms. We believe that one shouldn't have to be an AI expert to have access to cutting edge vision algorithms. Likewise, researchers shouldn't have to worry about building a service around their deep learning models to showcase and share it with others.
Kotlin is a modern programming language that makes developers happier.
JetBrains and Google are the current and founding members of the Kotlin Foundation. They work together to promote and advance the Kotlin language for many platforms, including Android, servers, iOS, and other targets. Today, 100+ people work on the core Kotlin project team at JetBrains & Google. There are an additional 350+ independent contributors to the core language and thousands contributing to the broader Kotlin ecosystem.
The Kotlin Foundation's responsibilities include: - Preservation the Kotlin trademarks - Appoint the Lead Language Designer - Control incompatible changes to the language
An open-source simple e2e test generation toolkit for developers. Keploy creates test cases, data mocks, stubs from API calls, DB queries, etc.
MDAnalysis is a Python library for the analysis of computer simulations of many-body systems at the molecular scale, spanning use cases from interactions of drugs with proteins to novel materials. It is written by scientists for scientists and is used for cutting edge research in biophysics, chemistry, soft-matter physics, and materials research around the world in academia and national research labs. MDAnalysis strives to be highly interoperable and hence a growing number of projects use MDAnalysis as their foundational library or integrate it.
The goal of MDAnalysis is to make it easy for users to analyze data that are produced by simulations (primarily molecular dynamics simulations) that run on some of the largest supercomputers in the world. MDAnalysis accomplishes this goal by providing a toolkit of programming building blocks that provide an abstract Python interface to the simulation data — agnostic of the specific simulation package that produced it — that lends itself to interactive data exploration and rapid prototyping but is also a robust foundational library that can form the basis for new computational tools.
MDAnalysis allows one to read particle-based trajectories such as the ones produced by MD simulations or individual coordinate frames (such as biomolecules in the Protein Databank format) and access the atomic coordinates through NumPy arrays. Together with a powerful selection language and many implemented analysis algorithms, MDAnalysis provides a flexible and fast framework for complex analysis tasks. Welcoming documentation such as the User Guide https://userguide.mdanalysis.org/ make it easy to get started. New releases are downloaded a few thousand times and the academic papers describing MDAnalysis are cited more than almost two thousand times, indicating the widespread use in the academic community.
Fortran-lang is an open-source community that develops tools and libraries for modern Fortran development. Our flagship projects include the standard library, Fortran build system and package manager, as well as the interactive compiler, LFortran. Fortran-lang also provides an inclusive and welcoming space for all Fortran enthusiasts around the world to collaborate.
Every user has work they need to do. The goal of Gentoo is to design tools and systems that allow a user to do that work as pleasantly and efficiently as possible, as they see fit. Our tools should be a joy to use, and should help the user to appreciate the richness of the Linux and free software community, and the flexibility of free software. This is only possible when the tool is designed to reflect and transmit the will of the user, and leave the possibilities open as to the final form of the raw materials (the source code.) If the tool forces the user to do things a particular way, then the tool is working against, rather than for, the user. We have all experienced situations where tools seem to be imposing their respective wills on us. This is backwards, and contrary to the Gentoo philosophy.
Put another way, the Gentoo philosophy is to create better tools. When a tool is doing its job perfectly, you might not even be very aware of its presence, because it does not interfere and make its presence known, nor does it force you to interact with it when you don’t want it to. The tool serves the user rather than the user serving the tool.
The goal of Gentoo is to strive to create near-ideal tools. Tools that can accommodate the needs of many different users all with divergent goals. Don’t you love it when you find a tool that does exactly what you want to do? Doesn’t it feel great? Our mission is to give that sensation to as many people as possible.
KubeVirt is a virtual machine management extension for Kubenetes. It does this by adding virtualization resources through the Kubernetes Custom Resource Definition API.
KubeVirt is a Cloud Native Computing Project (CNCF) incubating project.
The Oppia project aims to empower learners across the globe by providing access to high-quality, engaging education. We envision a world where access to high-quality education is not a privilege but a human right.
The team works on two platforms:
- (a) Oppia Web, which provides an online learning tool that enables anyone to learn from effective and engaging interactive lessons (called 'explorations'), which simulate a one-on-one conversation with a tutor. This format makes it possible for students to learn by doing while getting feedback. The Oppia Web platform also provides the infrastructure needed to support lesson creation and translation.
- (b) Oppia Android, which provides a way for these lessons to be played offline on an Android app that supports low-end devices and does not require Internet connectivity.
As a community, we are also aware that millions of students in underserved communities lack access to the educational resources necessary to effectively learn key skills like basic numeracy. Thus, in addition to developing the Oppia platform, the team has launched and continues to develop a set of free and effective lessons on basic mathematics, supplemented by translations and voiceovers. Students using these lessons have shown strong improvements between pre-and post-tests, and we’ve received lots of positive feedback on them. We are planning to extend this offering to other subjects, based on what students (and the nonprofits working with them) tell us would be most useful.
CircuitVerse is an easy to use digital logic circuit simulator which aims at providing a platform to create, share and learn digital circuits. It can run on almost any device without the need for installing any software. The platform has been designed for use by students, professionals and hobbyists alike. The vision is to develop a community around the platform that will aid students to self-learn digital logic design. The platform is currently used by several universities worldwide. Apart from the simulator, users can create, learn, collaborate and share their work. Teachers can create groups and host assignments on the platform. The platform’s impact has been more evident than ever in the Covid 19 pandemic as CircuitVerse enabled schools and colleges to move their courses online.
GNU Radio is a free & open-source software development toolkit that provides signal processing blocks to implement software radios. It can be used with readily-available low-cost external RF hardware to create software-defined radios, or without hardware in a simulation-like environment. It is widely used in research, industry, academia, government, and hobbyist environments to support both wireless communications research and real-world radio systems.
In brief, a software radio is a radio system which performs the required signal processing in software instead of using dedicated integrated circuits in hardware. The benefit is that since software can be easily replaced in the radio system, the same hardware can be used to create many kinds of radios for many different communications standards; thus, one software radio can be used for a variety of applications!
You can use GNU Radio to write applications to receive and transmit data with radio hardware, or to create entirely simulation-based applications. GNU Radio has filters, channel codes, synchronization elements, equalizers, demodulators, vocoders, decoders, and many other types of blocks which are typically found in signal processing systems. More importantly, it includes a method of connecting these blocks and then manages how data is passed from one block to another. Extending GNU Radio is also quite easy; if you find a specific block that is missing, you can quickly create and add it.
GNU Radio applications can be written in either C++ or Python, while the performance-critical signal processing path is implemented in C++ using processor floating-point extensions where available. This enables the developer to implement real-time, high-throughput radio systems in a simple-to-use, rapid-application-development environment.
ScummVM is a game preservation projects that celebrated its 20th anniversary last year. Originally focused on 2D Point&Click adventure games, its scope widened in 2016 to RPG thanks to a successful GSoC students, and to 3D games in 2020 after the merge with its sister project, ResidualVM. The purpose is only to replace the game executable, not to enhance or replace the game assets.
Swift is a general-purpose programming language built using a modern approach to safety, performance, and software design patterns.
The goal of the Swift project is to create the best available language for uses ranging from systems programming, to mobile and desktop apps, scaling up to cloud services. Most importantly, Swift is designed to make writing and maintaining correct programs easier for the developer.
SymPy is a Python library for symbolic mathematics. It aims to become a full-featured computer algebra system (CAS) while keeping the code as simple as possible in order to be comprehensible and easily extensible. SymPy is written entirely in Python.
MIT App Inventor
MIT App Inventor is a free, open source web app that anyone can use to build mobile apps. It has been used by over 8 million people worldwide who have built more than 30 million apps. It is available in twelve different languages and used by people from ages 13 and up.
The FreeBSD Project
FreeBSD is an advanced operating system for server, desktop, and embedded computer platforms. It offers advanced networking, impressive security features, and world class performance. It is used by some of the world's busiest web sites as well as embedded networking and storage devices. FreeBSD source code is the foundation for the PlayStation 5, Junos, the operating system used by Juniper routers to provide the backbone of the Internet, and was the starting point for the core of Apple's OS X (now MacOS). FreeBSD also powers the servers Netflix uses to stream terabits of video every second. It is possible that you are using FreeBSD right now without even realizing it.
The FreeBSD Project began nearly 30 years ago in 1993, but is based on work from the Berkeley Computer Systems Research Group dating back to 1978. Over those years the code base has gone through continuous development, improvement, and optimization. It offers a complete operating system from which students can choose to work on topics in the kernel or userland. In addition to producing an operating system, FreeBSD has incubated the development of software components used by other open source projects including bsnmp, jemalloc, libarchive, and OpenPAM.
FreeBSD has an active mentoring program to welcome new developers into our community. There are currently over 300 developers with write access to our repositories and many more contributors. Many of our past GSoC students have gone on to become FreeBSD developers. We communicate with one another in the many mailing lists, forums, blogs, IRC channels, and user groups all listed on our main website.
Organic Maps is the ultimate companion app for travelers, tourists, hikers, and cyclists:
- Detailed offline maps with places that don't exist on other maps, thanks to OpenStreetMap
- Cycling routes, hiking trails, and walking paths
- Contour lines, elevation profiles, peaks, and slopes
- Turn-by-turn walking, cycling, and car navigation with voice guidance
- Fast offline search on the map
- Bookmarks export and import in KML/KMZ formats (GPX is planned)
- Dark Mode to protect your eyes
- Countries and regions don't take a lot of space
- Free and open-source
Organic Maps is pure and organic, made with love:
- Respects your privacy
- Saves your battery
- No unexpected mobile data charges
The Mifos Initiative
We are a global 501(c)3 fintech non-profit leveraging the cloud, mobile & open source community to democratize financial services worldwide and digitally transform the world’s 3 billion poor and underbanked. Mifos has pioneered open source banking technology for the past fifteen years transforming the entire sector at each major stage of evolution from microfinance to financial inclusion to digital financial services and now embedded finance. Mifos guides the open source community, steers the roadmap, and stewards the vibrant ecosystem of organizations building solutions on its open platform. Our building blocks for banking, recognized as digital public goods, make core banking commoditized infrastructure, empowering any organization, anywhere to embed any financial service to any customer via any channel. These building blocks provide the common functionalities for creating customers, managing wallets, savings and loan accounts, orchestrating payments, and maintaining the financial ledger & reports. More than 20 million clients are reached by 500+ financial institutions across 41 countries using solutions powered by our APIs.
R project for statistical computing
R provides a wide variety of statistical and graphical techniques, and is highly extensible. R is often the tool of choice for research in statistical methodology.
R is an integrated suite of software facilities for data manipulation, calculation and graphical display. It includes
- an effective data handling and storage facility,
- a suite of operators for calculations on arrays, in particular matrices,
- a large, coherent, integrated collection of intermediate tools for data analysis,
- graphical facilities for data analysis and display either on-screen or on hardcopy, and
- a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.
The term “environment” is intended to characterize it as a fully planned and coherent system, rather than an incremental accretion of very specific and inflexible tools, as is frequently the case with other data analysis software.
R, like S, is designed around a true computer language, and it allows users to add additional functionality by defining new functions. Much of the system is itself written in the R dialect of S, which makes it easy for users to follow the algorithmic choices made. For computationally-intensive tasks, C, C++ and Fortran code can be linked and called at run time. Advanced users can write C code to manipulate R objects directly.
Many users think of R as a statistics system. We prefer to think of it of an environment within which statistical techniques are implemented. R can be extended (easily) via packages. There are about eight packages supplied with the R distribution and many more are available through the CRAN family of Internet sites covering a very wide range of modern statistics.
R has its own LaTeX-like documentation format, which is used to supply comprehensive documentation, both on-line in a number of formats and in hardcopy.
Micro Electronics Research Lab - UITU
A non-profit organization fostering research on IoT, AI and ML based architectures leveraging the open-source RISC-V ISA. The main purpose of this Lab is to train undergraduate students for future Silicon industry.
TensorFlow is an open-source machine learning framework for everyone. Originally developed by Google, TensorFlow now has >1,000 contributors from all around the world. TensorFlow supports multiple programming languages and applications, including Python, mobile, and web.
NumFOCUS supports and promotes world-class, innovative, open source scientific software. Most individual projects, even the wildly successful ones, find the overhead of a non-profit to be too large for their community to bear. NumFOCUS provides a critical service as an umbrella organization for this projects.
GNU Image Manipulation Program
GIMP is a cross-platform image editor available for GNU/Linux, macOS, Windows and more operating systems. It is free software, you can change its source code and distribute your changes.
Whether you are a graphic designer, photographer, illustrator, or scientist, GIMP provides you with sophisticated tools to get your job done. You can further enhance your productivity with GIMP thanks to many customization options and 3rd party plugins.
CRIU (stands for Checkpoint/Restore In Userspace), is a Linux software. It can freeze a running container (or an individual application) and checkpoint its state to disk. The data saved can be used to restore the application and run it exactly as it was during the time of the freeze. Using this functionality, application or container live migration, snapshots, remote debugging, and many other things are now possible. CRIU is packaged for all leading Linux distributions and it is integrated wit lots of popular projects such as Docker, Podman, LXC/LXD, OpenVZ, runc, open-mpi and others
XMPP Standards Foundation
The XMPP Standards Foundation (XSF) is an independent, nonprofit standards development organization with the mission to build an open, secure, feature-rich, federated infrastructure for real-time communication and collaboration over the Internet.
We seek to achieve that goal by developing the world’s best open protocols for instant messaging, presence, and other forms of near-real-time communication, based on the IETF’s Extensible Messaging and Presence Protocol (XMPP). By “best” we mean simplest, most extensible, most powerful, most secure. Moreover, we value freedom, openness, and good technical design.
PostgreSQL is a powerful, open source object-relational database system with over 30 years of active development that has earned it a strong reputation for reliability, feature robustness, and performance.
This project is for programmers needing a working SSH implementation by the mean of a library. The complete control of the client is made by the programmer. With libssh, you can remotely execute programs, transfer files, use a secure and transparent tunnel for your remote programs. With its Secure FTP implementation, you can play with remote files easily, without third-party programs others than libcrypto (from openssl), libgcrypt or mbedTLS.
LLVM Compiler Infrastructure
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Despite its name, LLVM has little to do with traditional virtual machines. The name LLVM itself is not an acronym; it is the full name of the project.
LLVM began as a research project at the University of Illinois, with the goal of providing a modern, SSA-based compilation strategy capable of supporting both static and dynamic compilation of arbitrary programming languages. Since then, LLVM has grown to be an umbrella project consisting of a number of subprojects, many of which are being used in production by a wide variety of commercial and open source projects as well as being widely used in academic research.
In addition to official subprojects of LLVM, there are a broad variety of other projects that use components of LLVM for various tasks. Through these external projects you can use LLVM to compile Ruby, Python, Haskell, Rust, D, PHP, Pure, Lua, Julia, and a number of other languages. A major strength of LLVM is its versatility, flexibility, and reusability, which is why it is being used for such a wide variety of different tasks: everything from doing light-weight JIT compiles of embedded languages like Lua to compiling Fortran code for massive super computers.
The NetBSD Foundation
NetBSD is a free, fast, secure, and highly portable Unix-like Open Source operating system. It is available for a wide range of platforms, from large-scale servers and powerful desktop systems to handheld and embedded devices. Its clean design and advanced features make it excellent for use in both production and research environments, and the source code is freely available under a business-friendly license. NetBSD is developed and supported by a large and vivid international community. Many applications are readily available through pkgsrc, the NetBSD Packages Collection.
ArduPilot is the world's most widely used open source flight code software for unmanned autonomous vehicles including planes, multicopters, helicopters, cars, boats, submarines, blimps, antenna trackers and much more.
Written primarily in C++, ArduPilot supports over 70 different types of autopilot hardware including the well known Pixhawk autopilot.
Our team motto, Versatile, Trusted, Open reflects our team's aim to provide high quality autopilot software that reliably supports a huge variety of frames, sensors and use cases. The software is open but so is the team, always welcoming of new contributors whether they be software developers, wiki documentors, testers or users.
OpenMRS provides the foundational, electronic medical record technology for more than 6,500 health facilities in over 40 countries, touching and helping millions of patients throughout the world.
The OpenMRS Community’s mission is to improve healthcare delivery in resource-constrained environments.
We coordinate a global community that creates and sustains a robust, scalable, user-driven and open-source medical record platform and reference frontend application.
We maintain a platform that countries and implementers use to create a customized EMR system in response to actual needs on the ground.
OpenCV promotes the beneficial uses of computer vision in society primarily through maintaining a state-of-the-art open and free code base and via courseware, contests, workshops and conferences.
OpenCV, the Open Source Computer Vision Library includes state of the art computer vision and deep learning algorithms (including running deep networks) and apps. It is professionally coded and optimized. It can be used in C++, Python, javascipt, Julia, Cuda, OpenCL and Matlab. It runs on: Android, iOS, Windows, Linux and MacOS and many embedded implementations such as Raspberry Pi, Movidius, and RISC-V.
Our community mainly works on four development areas: 1.- Education in Robotics * RoboticsAcademy (https://jderobot.github.io/RoboticsAcademy/): a ROS-based framework to learn robotics and computer vision with drones, autonomous cars.... It is a collection of Python programmed exercises for engineering students. * Unibotics: a web based framework for teaching robotics.
2.- Robot Programming Tools * VisualCircuit (https://jderobot.github.io/VisualCircuit/) for robot programming with connected blocks, as in electronic circuits, in a visual way * VisualStates for robot programming with Finite State Machines in a visual way * WebSim2D robot simulator with web technologies
3.- MachineLearning in Robotics * DeepLearningSuite: neural networks for robot control. It includes the BehaviorMetrics tool for assessment of neural networks for autonomous driving * RL-Studio: Robotic library for the training of Reinforcement Learning algorithms * DetectionMetrics tool for evaluation of visual detection neural networks and algorithms
4.- FPGAs in Robotics * FPGA-robotics (https://github.com/JdeRobot/FPGA-robotics): programming robots with reconfigurable computing (FPGAs) using open tools as IceStudio and Symbiflow. Verilog-based reusable blocks for robotics applications. * NeuralFPGA: running deeplearning networks on FPGAs
GNU Octave is a high-level interpreted language, primarily intended for numerical computations. It provides capabilities for the numerical solution of linear and nonlinear problems and for performing other numerical experiments. It also provides extensive graphics capabilities for data visualization and manipulation. Octave is normally used through its interactive command line interface, but it can also be used to write non-interactive programs. The Octave language is quite similar to Matlab so that most programs are easily portable.
Octave is continually being upgraded. Student projects may also involve developing or upgrading Octave Forge packages, which can be loaded to provide additional specialized functions that supplement those provided in Core Octave.
CERN-HSF (High-Energy Physics Software Foundation) is the umbrella organization for high-energy physics-related projects in GSoC. The HEP Software Foundation (http://hepsoftwarefoundation.org/) facilitates the coordination of common international efforts in high-energy physics software and computing.
CERN (European Organization for Nuclear Research, https://home.cern) has participated in GSoC since 2011 as the CERN-SFT group, which provides common software for CERN's experiments. In 2017, the program expanded to include many software projects from the whole field of high-energy physics. The vast majority of our GSoC projects do not require any physics knowledge.
The experiments at CERN, such as the Large Hadron Collider, the world’s largest and most powerful particle accelerator (http://home.cern/topics/large-hadron-collider) try to answer fundamental questions about the Universe. For example, what is the nature of mass? What are the elementary building blocks of the Universe? What was the early Universe like? What is the nature of dark matter and dark energy? Why is there an asymmetry between matter and antimatter? In 2012, LHC experiments announced the discovery of a new particle, the Higgs Boson, that helps explain how particles obtain mass. Also, CERN is the birthplace of the World Wide Web. Today, particle physicists are working on analyzing the data from the experiments to study the properties of the newly discovered particle and to search for new physics, such as dark matter or extra dimensions. This requires a lot of sophisticated software.
The open-source high-energy physics projects to which students can contribute during GSoC span many high-energy physics software projects: data analysis, detector and accelerator simulation, event reconstruction, data management and many others. We look forward to your contributions!
Software and Computational Systems Lab at LMU Munich
The Software and Computational Systems Lab is a research group at LMU Munich, one of the leading universities in Germany. We focus on the development of new models and algorithms to efficiently analyze software systems and emphasize on tool implementations of the theoretical concepts. For GSoC 2023, we propose ideas involving several aspects of software analysis, from implementing and benchmarking verification algorithms to incorporating backend logic solvers.
The following list contains some projects that are suitable for GSoC. They are all open-source projects developed by our group.
<1> CPAchecker (https://cpachecker.sosy-lab.org) is an award-winning (https://cpachecker.sosy-lab.org/achieve.php) framework for formal verification of C programs. It is written in Java and based on a highly modular architecture that allows one to develop and combine a wide range of different analyses. CPAchecker has helped to find hundreds of bugs in the Linux kernel.
<2> JavaSMT (https://github.com/sosy-lab/java-smt) is a common API layer for accessing various SMT solvers from Java programs without having to use solver-specific APIs and care about subtle details. It supports the majority of common SMT solvers and is used by a wide range of projects.
<3> PJBDD (https://gitlab.com/sosy-lab/software/paralleljbdd) is a multi-threaded Binary Decision Diagram (BDD) library written in Java. It supports concurrent computation, parallel operations, and automated resource management.
<4> BenchExec (https://github.com/sosy-lab/benchexec) is a benchmarking framework (written in Python) that uses modern Linux features such as cgroups and namespaces to create benchmarking containers and measure their resource usage.
For a more detailed description and some prospective GSoC topics, please visit our webpage (https://www.sosy-lab.org/gsoc/gsoc2023.php#ideas). You are also welcome to suggest your own ideas!
Submitty is an open source programming assignment submission system with secure and automated testing and grading, efficient manual TA/instructor grading, and additional tools for overall course management and communication between students and instructional staff. Submitty was launched by the Rensselaer Center for Open Source Software (RCOS) in 2014.
- Secure testing of many programming languages: Python, C/C++, Java, etc.
- Customizable automated grading with immediate feedback to students, and optional hidden or randomized tests.
- Advanced grading tools: static analysis, JUnit, code coverage, memory debuggers, networked assignments, custom Docker containers, and screenshots/GIFs of graphics programs.
- Individual or team assignments submittyed by drag-and-drop or version control.
- Correct mistakes through multiple submissions, flexible late day’’ policy
- Interface for complementary instructor/TA manual grading, regrade requests, anonymized peer grading.
- Instructor bulk upload of scanned .pdf exams, QR code name matching, pdf annotation.
- Supports course material hosting, term grades spreadsheet, plagiarism detection.
- Integrated discussion forum, email announcements, lecture polling, office hours queue, and student activity dashboard.
- Scales to multiple courses, thousands of students, multiple instructors and TAs per course
- Open-source, free to use, install on your own hardware, or VPS
Submitty has been used at a half dozen other universities and we aim to grow to more users and developers. The courses using Submitty cover the undergraduate and graduate curriculum from introductory programming courses, intermediate and advanced theory courses, popular junior/senior electives with team projects and written reports, and specialized graduate courses.
We regularly present our work at the annual ACM SIGCSE conference.
OSGeo (Open Source Geospatial Foundation)
The Open Source Geospatial Foundation (OSGeo) is a not-for-profit organization whose mission is to foster global adoption of open geospatial technology by being an inclusive software foundation devoted to an open philosophy and participatory community-driven development.
OSGeo serves as an umbrella organization for the Open Source Geospatial community in general and several code projects in particular:
Web Mapping: deegree, geomajas, GeoMOOSE, GeoServer, Mapbender, MapFish, MapGuide Open Source, MapServer, OpenLayers.
Desktop Applications: GRASS GIS, gvSIG, Marble, QGIS.
Geospatial Libraries: FDO, GDAL/OGR, GEOS, GeoTools, OSSIM, PostGIS.
Metadata Catalogues: GeoNetwork, pycsw.
Content Management Systems: GeoNode.
Community Projects: pgRouting, istSOS, MetaCRS, Opticks, Orfeo ToolBox (OTB), PyWPS, Team Engine, ZOO-Project.
Other (non-code) Projects: GeoForAll (Education and Curriculum), OSGeoLive DVD, Public Geospatial Data.
We host regional and international FOSS4G conferences with typical attendance of 500-1100+ geospatial developers, industry and government leaders, and researchers. Our mailing lists collectively go out to ~ 30,000 unique subscribers.
DBpedia is a crowd-sourced community effort to extract structured information from Wikipedia and make this information available on the Web. It allows for a global and unified access to Knowledge Graphs.
GCP scanner is a tool that can help determine what level of access certain credentials possess on GCP. The scanner is designed to help security engineers with evaluating the impact of a certain VM/container compromise, GCP service account or OAuth2 token key leak by enumerating accessible GCP services. This can help to understand what an actual attacker had access to by compromising a certain identity.