List of Apache Software Foundation projects

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

This list of Apache Software Foundation projects contains the software development projects of the Apache Software Foundation (ASF).

Besides the projects, there are a few other distinct areas of Apache:

  • Incubator: for aspiring ASF projects
  • Attic: for retired ASF projects
  • Labs: a place for innovation where committees of the foundation can experiment with new ideas
  • INFRA - Apache Infrastructure Team: provides and manages all infrastructure and services for the Apache Software Foundation, and for each project at the Foundation

Active Projects[edit]

  • Accumulo: secure implementation of Bigtable
  • ActiveMQ: message broker supporting different communication protocols and clients, including a full Java Message Service (JMS) 1.1 client.
  • Airavata: Apache Airavata is a distributed system software framework to manage simple to composite applications with complex execution and workflow patterns on diverse computational resources.
  • Allura: Python-based an open source implementation of a software forge
  • Ambari: Apache Ambari makes Hadoop cluster provisioning, managing, and monitoring dead simple.
  • Ant: Java-based build tool
    • AntUnit: The Ant Library provides Ant tasks for testing Ant task, it can also be used to drive functional and integration tests of arbitrary applications with Ant
    • Ivy: Apache Ivy is a very powerful dependency manager oriented toward Java dependency management, even though it could be used to manage dependencies of any kind
    • IvyDE: Integrate Ivy in Eclipse with the IvyDE plugin
  • Any23: Anything To Triples (Any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents.
  • Apex: Enterprise-grade unified stream and batch processing engine.
  • APR: Apache Portable Runtime, a portability library written in C
  • Archiva: Build Artifact Repository Manager
  • Aries: OSGi Enterprise Programming Model
  • Arrow: "A high-performance cross-system data layer for columnar in-memory analytics".[1][2]
  • AsterixDB: Open source Big Data Management System
  • Atlas: Scalable and extensible set of core foundational governance services
  • Aurora: Mesos framework for long-running services and cron jobs
  • Avro: Apache Avro is a data serialization system.
  • Apache Axis Committee
    • Axis: open source, XML based Web service framework
    • Axis2: Apache Axis2 is a service hosting and consumption framework that makes it easy to use SOAP and Web Services
    • Rampart: implementation of the WS-Security standard for the Axis2 Web services engine
    • Sandesha2: Apache Sandesha2 is an Axis2 module implementing WS-RM.
  • Bahir: Extensions to distributed analytic platforms such as Apache Spark
  • Beam, an uber-API for big data
  • Bigtop: Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem.
  • Bloodhound: defect tracker based on Trac[3]
  • BookKeeper: A reliable replicated log service
  • Brooklyn: Apache Brooklyn is a framework for modeling, monitoring, and managing applications through autonomic blueprints.
  • Buildr: Apache Buildr is a build system for Java-based applications, including support for Scala, Groovy and a growing number of JVM languages and tools.
  • BVal: Bean Validation API Implementation
  • Calcite: dynamic data management framework
  • Camel: declarative routing and mediation rules engine which implements the Enterprise Integration Patterns using a Java-based domain specific language
  • CarbonData: Apache CarbonData is an indexed columnar data format for fast analytics on big data platform, e.g. Apache Hadoop, Apache Spark, etc.
  • Cassandra (2008): highly scalable second-generation distributed database
  • Cayenne: Java ORM framework
  • Celix: Implementation of the OSGi specification adapted to C and C++.
  • Chemistry: Apache Chemistry provides open source implementations of the Content Management Interoperability Services (CMIS) specification.
  • Chukwa: Chukwa is an open source data collection system for monitoring large distributed systems.
  • Clerezza: Clerezza is a service platform which provides a set of functionality for management of semantically linked data accessible through RESTful Web Services and in a secured way
  • CloudStack: software to deploy and manage cloud infrastructure
  • Cocoon: XML publishing framework
  • Commons: Reusable Java libraries and utilities too small to merit their own project
    • BCEL: Bytecode Engineering Library
    • Daemon: Commons Daemon
    • Jelly: Jelly is a Java and XML based scripting engine. Jelly combines the best ideas from JSTL, Velocity, DVSL, Ant and Cocoon all together in a simple yet powerful scripting engine.
    • Logging: Commons Logging is a thin adapter allowing configurable bridging to other, well known logging systems
    • OGNL: Object Graph Navigation Library
  • Cordova: mobile development framework
  • CouchDB: Document-oriented database
  • Apache Creadur Committee
    • Rat: Apache Rat improves accuracy and efficiency when reviewing and auditing releases.
    • Tentacles: Apache Tentacles simplifies the job of reviewing repository releases consisting of large numbers of artifacts.
    • Whisker: Apache Whisker assists assembled applications to maintain correct legal documentation.
  • Crunch: Provides a framework for writing, testing, and running MapReduce pipelines
  • cTAKES: clinical "Text Analysis Knowledge Extraction Software" to extract information from electronic medical record clinical free-text
  • Curator: Apache Curator builds on ZooKeeper and handles the complexity of managing connections to the ZooKeeper cluster and retrying operations.
  • CXF: web services framework
  • DataFu: Apache DataFu is a collection of libraries for working with large-scale data in Hadoop.
  • Apache DB Committee
  • DeltaSpike: DeltaSpike is a collection of portable Extensions for CDI Containers
  • Apache Directory Committee
    • Directory: LDAP and Kerberos, entirely in Java
    • Directory Server: An extensible, embeddable LDAP and Kerberos server, entirely in Java
    • Directory Studio: The Eclipse based LDAP browser and directory client
    • Fortress: A standards-based access management system, written in Java.
    • Kerby: Kerberos binding in Java
    • LDAP API: An SDK for directory access in Java.
  • DRAT: large scale code license analysis, auditing and reporting
  • Drill: software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets
  • Eagle: open source analytics solution for identifying security and performance issues instantly on big data platforms
  • Empire-db: Apache Empire-db is a lightweight relational database abstraction layer and data persistence component
  • Falcon: Data governance engine
  • Felix: Implementation of the OSGi Release 5 core framework specification
  • Fineract: Platform for Digital Financial Services
  • Flex: cross-platform SDK for developing and deploying rich Internet applications.
  • Flink: Fast and reliable large-scale data processing engine.
  • Flume: large scale log aggregation framework
  • Apache Fluo Committee
    • Fluo: Apache Fluo is a distributed processing system that lets users make incremental updates to large data sets.
    • Fluo Recipes: Apache Fluo Recipes build on the Fluo API to offer additional functionality to developers.
    • Fluo YARN: Apache Fluo YARN is a tool for running Apache Fluo applications in Apache Hadoop YARN.
  • Forrest: documentation framework based upon Cocoon
  • FreeMarker: FreeMarker is a template engine, i.e. a generic tool to generate text output based on templates. FreeMarker is implemented in Java as a class library for programmers.
  • Geode: Low latency, high concurrency data management solutions
  • Geronimo: Java EE server
  • Giraph: scalable Graph Processing System
  • Gora: The Apache Gora open source framework provides an in-memory data model and persistence for big data.
  • Groovy: an object-oriented, dynamic programming language for the Java platform
  • Guacamole: HTML5 web application for accessing remote desktops
  • Gump: integration, dependencies, and versioning management
  • Hadoop: Java software framework that supports data intensive distributed applications
  • Hama: Hama is an efficient and scalable general-purpose BSP computing engine.
  • HAWQ: Apache HAWQ is Apache Hadoop Native SQL. Advanced Analytics MPP Database for Enterprises.
  • HBase: Apache HBase software is the Hadoop database. Think of it as a distributed, scalable, big data store.
  • Helix: A cluster management framework for partitioned and replicated distributed resources
  • Hive: The Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
  • HTTP Server: The Apache HTTP Server application 'httpd'
  • Apache HttpComponents: Low-level Java libraries for HTTP
  • Ignite: Apache Ignite is an In-Memory Data Fabric providing in-memory data caching, partitioning, processing, and querying components.
  • Impala: A high-performance distributed SQL engine
  • Isis: Apache Isis is a framework for rapidly developing domain-driven apps in Java
  • Jackrabbit: implementation of the Java Content Repository API
  • JAMES: Java email and news server
  • jclouds: open source multi-cloud toolkit for the Java platform
  • Jena is an open source Semantic Web framework for Java
  • JMeter: Pure Java application for load and functional testing
  • Johnzon: JSR-353 compliant JSON parsing; modules to help with JSR-353 as well as JSR-374 and JSR-367
  • JSPWiki: A feature-rich and extensible WikiWiki engine built around the standard J2EE components (Java, servlets, JSP).
  • Apache jUDDI Committee
    • Scout: Apache Scout is an implementation of the JSR 93 (JAXR).
  • Juneau: A toolkit for marshalling POJOs to a wide variety of content types using a common framework
  • Kafka (2011): Message broker software
  • Karaf: Apache Karaf is an OSGi distribution for server-side applications.
  • Kibble: Apache Kibble is a suite of tools for collecting, aggregating and visualizing activity in software projects.
  • Knox: A REST API Gateway for Hadoop Services
  • Kudu: A distributed columnar storage engine built for the Apache Hadoop ecosystem
  • Kylin: distributed analytics engine
  • Lens: Unified Analytics Interface
  • Libcloud: Apache Libcloud is a standard Python library that abstracts away differences among multiple cloud provider APIs.
  • Apache Logging Services Committee
    • Chainsaw: Apache Chainsaw is a GUI log viewer.
    • Log4cxx: Apache log4cxx provides logging services for C++.
    • Log4j: Apache Log4j
    • Log4net: Apache log4net provides logging services for .NET.
    • Log4php: Apache log4php is a logging framework for PHP.
  • Apache Lucene Committee
    • Lucene Core: A high-performance, full-featured text search engine library
    • Solr: enterprise search server based on the Lucene Java search library
  • Lucene.Net: Lucene.Net is a port of the Lucene search engine library, written in C# and targeted at .NET runtime users.
  • Lucy: The Apache Lucy search engine library provides full-text search for dynamic programming languages.
  • MADlib: Scalable, Big Data, SQL-driven machine learning framework for Data Scientists
  • Mahout: machine learning and data mining solution. Mahout
  • ManifoldCF: Open-source software for transferring content between repositories or search indexes
  • Marmotta: An Open Platform for Linked Data
  • Maven: Java project management and comprehension tool
    • Doxia: Doxia is a content generation framework, which supports many markup languages.
  • Mesos: open-source cluster manager
  • MetaModel: Providing a common interface for discovery, exploration of metadata and querying of different types of data sources.
  • Metron: Real-time big data security
  • Apache MINA Committee
    • FtpServer: FTP server written entirely in Java
    • MINA: Multipurpose Infrastructure for Network Application, a framework to develop high performance and high scalability network applications. MINA
    • SSHD: Apache SSHD is a 100% pure java library to support the SSH protocols on both the client and server side.
    • Vysper: Apache Vysper aims to be a modular, full featured XMPP (Jabber) server. Vysper is implemented in Java.
  • Mnemonic: a transparent nonvolatile hybrid memory oriented library for Big data, High-performance computing, and Analytics
  • Apache MyFaces Committee
  • Mynewt: embedded OS optimized for networking and built for remote management of constrained devices
  • NiFi: Easy to use, powerful, and reliable system to process and distribute data
  • Nutch: a highly extensible and scalable open source web crawler
  • ODE: Apache ODE is a WS-BPEL implementation that supports web services orchestration using flexible process definitions.
  • OFBiz: Open for Business: enterprise automation software
  • Olingo: Client and Server for OData
  • OODT: Object Oriented Data Technology, a data management framework for capturing and sharing data
  • Oozie: Oozie is a workflow scheduler system to manage Apache Hadoop jobs.
  • Open Climate Workbench: A comprehensive suite of algorithms, libraries, and interfaces designed to standardize and streamline the process of interacting with large quantities of observational data and conducting regional climate model evaluations.
  • OpenJPA: Java Persistence API Implementation
  • OpenMeetings: Video conferencing, instant messaging, white board and collaborative document editing application.
  • OpenNLP: natural language processing toolkit.
  • OpenOffice: An open-source, office-document productivity suite
  • OpenWebBeans: Dependency Injection Platform
  • ORC: A columnar file format for big data workloads
  • Parquet: A general-purpose columnar storage format
  • PDFBox: Java based PDF library (reading, text extraction, manipulation, viewer)
  • Mod_perl: module that integrates the Perl interpreter into Apache server
  • Phoenix: SQL layer on HBase
  • Pig: A platform for analyzing large data sets on Hadoop
  • Pivot: A platform for building rich internet applications in Java
  • POI: Poor Obfuscation Implementation, a library for reading and writing Microsoft Office formats
  • Polygene: Exploration of Composite Oriented Programming for domain-centric application development
  • Portals: web portal related software
  • PredictionIO: PredictionIO is an open source Machine Learning Server built on top of state-of-the-art open source stack, that enables developers to manage and deploy production-ready predictive services for various kinds of machine learning tasks.
  • Pulsar: Distributed pub-sub messaging system originally created at Yahoo
  • Qpid: AMQP messaging system in Java and C++
  • Ranger: A framework to enable, monitor and manage comprehensive data security across the Hadoop platform.
  • REEF: A scale-out computing fabric that eases the development of Big Data applications on top of resource managers such as Apache YARN and Mesos.
  • River: Apache River software provides a standards-compilani JINI service.
  • RocketMQ: A fast, low latency, reliable, scalable, distributed, easy to use message-oriented middleware, especially for processing large amounts of streaming data
  • Roller: Apache Roller is a full-featured, multi-user and group blog server suitable for both small and large blog sites.
  • Royale: Improving developer productivity in creating applications for wherever Javascript runs (and other runtimes)
  • Samza: Stream Processing Framework
  • Santuario: XML Security in Java and C++
  • Sentry: Fine grained authorization to data and metadata in Apache Hadoop
  • Serf: High performance C-based HTTP client library built upon the Apache Portable Runtime (APR) library.
  • ServiceMix: enterprise service bus that supports JBI and OSGi
  • Shiro: A simple to use Java Security Framework
  • Spatial Information System (SIS): A library for developing geospatial applications
  • Sling: Innovative Web framework based on JCR and OSGi
  • SpamAssassin: email filter used to identify spam
  • Spark: open source cluster computing framework
  • Sqoop: A tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.
  • Stanbol: Software components for semantic content management
  • Steve: Apache STeVe is a collection of online voting tools, used by the ASF, to handle STV and other voting methods.
  • Storm: Apache Storm is a distributed real-time computation system.
  • Streams: Interoperability of online profiles and activity feeds
  • Struts: Java web applications framework
  • Subversion: open source version control (client/server) system
  • Synapse: A lightweight and high-performance Enterprise Service Bus (ESB)
  • Syncope: Apache Syncope is an Open Source system for managing digital identities in enterprise environments.
  • SystemML: Scalable machine learning
  • Tajo: Relational data warehousing system. It using the hadoop file system as distributed storage.
  • Tapestry: Component-based Java web framework
  • Apache Tcl Committee
    • Tcl integration for Apache httpd
    • Rivet: Server-side Tcl programming system combining ease of use and power
    • Websh: Websh is a rapid development environment for building powerful, fast, and reliable web applications in Tcl
  • Tez: Apache Tez is an effort to develop a generic application framework which can be used to process arbitrarily complex directed-acyclic graphs (DAGs) of data-processing tasks and also a re-usable set of data-processing primitives which can be used by other projects.
  • Thrift : Interface definition language and binary communication protocol that is used to define and create services for numerous languages.
  • Tika: content analysis toolkit for extracting metadata and text from digital documents of various types, e.g., audio, video, image, office suite, web, mail, and binary
  • Tiles: Apache Tiles is a templating framework built to simplify the development of web application user interfaces.
  • TinkerPop: A graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP)
  • Tomcat: web container for serving servlets and JSP
  • TomEE: Apache TomEE is an all-Apache Java EE 6 Web Profile stack for Apache Tomcat
  • Traffic Control: Built around Apache Traffic Server as the caching software, Traffic Control implements all the core functions of a modern CDN. Traffic Control
  • Traffic Server: HTTP/1.1 compliant caching proxy server. Traffic Server
  • Trafodion: Webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Apache Hadoop.
  • Turbine: Turbine is a servlet based framework that allows Java developers to quickly build web applications.
  • Twill: Use Apache Hadoop YARN's distributed capabilities with a programming model that is similar to running threads
  • UIMA: unstructured content analytics framework
  • Usergrid: Usergrid is an open-source Backend-as-a-Service ("BaaS" or "mBaaS") composed of an integrated distributed NoSQL database, application layer and client tier with SDKs for developers looking to rapidly build web and/or mobile applications.
  • VCL: A cloud computing platform for provisioning and brokering access to dedicated remote compute resources.
  • Apache Velocity Committee:
    • Anakia: Anakia is an XML transformation tool which uses JDOM and Velocity to transform XML documents into the format of your choice.
    • Texen: Texen is a general purpose text generating utility based on Apache Velocity and Apache Ant.
    • Velocity: Java template creation engine
    • Apache Velocity DVSL: A tool modeled after XSLT and intended for general XML transformations using the Velocity Template Language.
    • Apache Velocity Tools: Tools and infrastructure for the template engine
  • VXQuery: Apache VXQuery implements a parallel XML Query processor.
  • Apache Web Services Committee
    • Axiom: Apache Axiom is an XML object model supporting deferred parsing.
    • Woden: Apache Woden is used to develop a Java class library for reading, manipulating, creating and writing WSDL documents.
  • Whimsy: Tools that display and visualize various bits of data related to ASF organizations and processes.
  • Wicket: Component-based Java web framework
  • Xalan: XSLT processors in Java and C++
  • Xerces: validating XML parser
  • Apache XML Graphics Committee
    • Batik: pure Java library for SVG content manipulation
    • FOP: Java print formatter driven by XSL formatting objects (XSL-FO); supported output formats include PDF, PS, PCL, AFP, XML (area tree representation), Print, AWT and PNG, and to a lesser extent, RTF and TXT
    • XML Graphics Commons: Common components for Apache Batik and Apache FOP
  • Yetus: A collection of libraries and tools that enable contribution and release processes for software projects.
  • Zeppelin: Zeppelin is a collaborative data analytics and visualization tool for distributed, general-purpose data processing systems
  • ZooKeeper: Coordination service for distributed applications

Incubating projects[edit]

The above is incomplete, as the list of incubating project changes frequently.

Retired Projects[edit]

A retired project is one which has been closed down on the initiative of the board, the project its PMC, the PPMC or the IPMC for various reasons. It is no longer developed at the Apache Software Foundation and does not have any other duties.

References[edit]

  1. ^ "Apache Arrow". Apache Software Foundation. Retrieved 12 May 2016.
  2. ^ "The Apache Software Foundation Announces Apache Arrow as a Top-Level Project". Apache Software Foundation. Retrieved 12 May 2016.
  3. ^ "Bloodhound Project Incubation Status". Apache Software Foundation. Retrieved 21 March 2013.

"Apache Project List". The Apache Software Foundation. 2018. Retrieved 2018-05-19.