Per Ottosson is CEO of Artificial Solutions, the Sweden-based maker of Teneo software used for building custom chatbots, virtual assistants, and conversational interfaces in over 80 languages. He joined the company in 2020 after a decade in leadership roles at IPSoft and two decades in enterprise software.Read More
ContinualAI Avalanche open source library is based on Pytorch, used for fast prototyping, training and reproducible evaluation of continual learning algorithms.Read More
Every day there are more and more start-ups that uses AI or build around them. AI has been around for almost 1950s and since 2008 with the introduction of AlexNets and deep learning, it became the hottest topic in Data science. Artificial Intelligence courses, books, and programs for entrepreneursRead More
Ideas about what constitutes “beauty” are complex, subjective, and by no means limited to physical appearances. Elusive though it is, everyone wants more of it. That means big business and increasingly, people harnessing algorithms to create their ideal selves in the digital and, sometimes, physical worlds. In this episode, we explore the popularity of beauty filters, and sit down with someone who’s convinced his software will show you just how to nip and tuck your way to a better life.Read More
These days data distributed training is all the rage. In data distributed training learning is performed on multiple workers in parallel.Read More
DeepMind’s AlphaGo and its successors previously demonstrated that the policy and heuristic function is formulated upon the PUCT (Polynomial Upper Confidence Trees) search algorithm. This algorithm can be quite effective for guiding search in adversarial games. However, PUCT is computationally inefficient and lacks guarantees on its search effort. Though other methods such as LevinTS provide guarantees on search steps, they do not use a heuristic function.Read More
When will we have artificial general intelligence, the kind of AI that can mimic the human mind in all aspect? Experts are divided on the topic, and answers range anywhere between a few decades and never. But what everyone agrees on is that current AI systems are a far shot from human intelligence.Read More
DevOps & CI/ CD are buzz words for a while now and they have really proven their value in today’s fast moving world and Agile development process. One understands the true value only when they have actually been a process of it and see for themselves the immense amount of time and head ache it saves.Read More
To help you understand this programming language better, we have compiled a list of the top books for beginners and advanced. All of these books have been recommended by the Python community.Read More
“…developing and deploying ML systems is relatively fast and cheap, but maintaining them over time is difficult and expensive.” – D. Sculley et al., “Hidden Technical Debt in Machine Learning Systems – Every data scientist can relate to this quote.Read More
Open source artificial intelligence projects don’t always get a lot of publicity, but they play a vital role in the development of artificial intelligence. Because these open source projects are often pursued as passion projects by developers (sometimes in colleges and universities), the advances are creative and particularly forward-looking.
Typically freed from the constraints of a corporate setting (though some are supported by companies), these open source AI projects can dream big – and often deliver ground-breaking machine learning (ML) and AI advances.
Also important: the advances from these leading open source AI projects fuel the larger AI sector. That is, a new idea from this month’s AI project ends up next year (or even next month) in a high- end AI solution sold by a company.
Remember, if you know of additional top open source AI tools that should be on this list, please include them in the comments section below.
Open Source AI Projects
PyTorch has all the elements you’d expect from a leading open source AI project. It focuses on machine learning, arguably the most popular use of AI in this stage of the emerging technology’s growth. Even more important, developers and AI engineers can set PyTorch up on the top cloud computing platforms; PyTorch on AWS and PyTorch on Azure are both viable, as well as Google Cloud and Alibaba. PyTorch offers neural networks, a foundational element of AI development.
Open Neural Network Exchange
Developed by Microsoft and Facebook, Open Neural Network Exchange offers some very powerful tools, most particularly the ability to recycle fully developed neural network models (which have spent hours and hours being trained in systems) into various other systems. In essence, the Open Neural Network Exchange greatly extends the usefulness of existing models by enabling this porting. Expect ONNX to grow ever more popular in the years ahead.
IBM’s AI Fairness 360
The problem with bias in artificial intelligence algorithms is a growing concern, and AI Fairness 360 is the open source solution to address this. The tool provides algorithms to enable a developer to scan a ML model to find any potential bias, an essential part of fighting bias – and certainly a complex task. Importantly, AI Fairness allows AI engineers to explore the algorithms throughout the development lifecycle. The tool can be set to work automatically. Built into the tool’s foundation is an architecture that checks for correlations; do the correlations create a prediction that suggests a harmful stereotype?
Keras is a rarity in the world of AI open source projects: it promotes itself as “an API designed for human beings, not machines.” A Python deep-learning API, Keras interoperates with high- profile AI projects like Theano and Microsoft Cognitive Toolkit. Developers and AI engineers use it as a ML library to build prototypes with comparative ease. Also aiding its ease of deployment, Keras can run on a mix of processor hardware.
As the name suggests, Accord.NET uses the .NET framework. It’s a .NET ML learning framework that offers image and audio libraries coded in C#. It’s forward-looking, in that it offers a platform for developing commercial-level applications, including apps geared for signal processing, audio-visual toolsets and statistics apps. If you’re just getting your feet wet, Accord also includes template apps so you can start building faster.
Certainly, an open source AI technology that’s generating buzz, Generative Pre-Trained Transformer 2 (GPT-2) was released by OpenAI in 2019. GPT leverages a deep neural network, which uses numerous layers of software to process any number of inputs. GPT-2 is broadly known for handling text, from translation to creating text that, at its best, can be remarkably similar to that written by humans. Moreover, it’s a widely powerful learning tool that can synthesize and adapt to data with significant accuracy.
This project is useful if you’re a ML or AI developer who could use a helping hand with open source ML/AI projects. More of a learning tool than a project, Cheatsheets assists you in getting up to speed with AI/ML projects, from Keras to Scripy to PySpark to Dask. The instruction offered is in-depth and necessarily complex. While Cheatsheets AI is designed for “AI newbies,” in fact you will need some prior training to use this resource.
Is there a developer who doesn’t know TensorFlow? It’s practically a household name. Developed by the Google Brain team for internal use at Google, TensorFlow is now one of the most well-known open source machine learning platforms. Google is also making a cloud-based version of TensorFlow available for free to researchers.
Originally created by the bright minds at UC Berkeley, Caffe has become a very popular deep learning framework. Its claims to fame include expressive architecture, extensible code and speed.
With a huge user base, H2O claims to be “the world’s leading open source deep learning platform.” In addition to the Open Source version, the company also offers a Premium edition with paid support.
Microsoft Cognitive Toolkit
Clearly, Microsoft has moved into the world of open source. Formerly known as CNTK, the Microsoft Cognitive Toolkit promises to train deep-learning algorithms to think like the human brain. It boasts speed, scalability, commercial-grade quality and compatibility with C++ and Python. Microsoft uses it to power the AI features in Skype, Cortana and Bing.
Another very big name in AI and ML. Intended for use in AI research, DeepMind Lab is a 3D game environment. It was created by the DeepMind group at Google and is said to be especially good for deep reinforcement learning research.
Developed at Carnegie Mellon University, ACT-R is the name of both a theory of human cognition and software based on that theory. The software is based on Lisp, and extensive documentation is available. Operating Systems: Windows, Linux, macOS.
StarCraft II API Library
You didn’t think AI was all work, did you? Google’s DeepMind and Blizzard Entertainment are collaborating on a project that makes it possible to use the StarCraft II video game as an AI research platform. It’s a cross-platform C++ library for building scripted bots.
The Numenta organization offers numerous open source projects related to hierarchical temporal memory. Essentially, these projects attempt to create machine intelligence based on current biological understandings of the human neocortex.
A big ambition, to be sure: instead of focusing on a narrow aspect of AI such as deep learning or neural networks, Open Cog aims to create beneficial artificial general intelligence (AGI). The project is working toward creating systems and robots with the capacity for human-like intelligence.
This Java-based natural language processing software can identify the base forms of words, their parts of speech and whether they are names of companies or people, as well as normalizing dates and times. It marks up the structure of sentences in terms of phrases and syntactic dependencies, indicating which noun phrases refer to the same entities, identifying sentiment, extracting particular or open-class relations between entity mentions and getting quotes. It was designed for English but also supports a wide array of languages.
Developed and used by Facebook – yes, they have deep resources – Prophet forecasts time series data. It’s implemented in R or Python and is fully automatic, accurate, fast and tunable.
Originally an IBM Research project, SystemML is now a top-level Apache project. It describes itself as “an optimal workplace for machine learning using big data,” and it integrates with Spark.
Deep learning can be thought of as the furthest edge of AI. Theano, geared for deep learning, describes itself as “a Python library that allows you to define, optimize and evaluate mathematical expressions involving multi-dimensional arrays efficiently.” Key features include GPU support, integration with NumPy, efficient symbolic differentiation, dynamic C code generation and more.
Short for “Machine Learning Language Toolkit,” MALLET includes Java-based tools for statistical natural language processing, document classification, clustering, topic modeling, information extraction and more. It was first created in 2002 by faculty and graduate students at the University of Massachusetts Amherst and the University of Pennsylvania.
An example of cross-collaboration in the open source AI sector, DeepDetect has been used by organizations like Airbus and Microsoft. DeepDetect is an open source deep learning server based on Caffe, TensorFlow and XGBoost. It offers an easy-to-use API for image classification, object detection, and text and numerical data analysis.
The research paper titled “Towards Causal Representation Learning” provides the way through which the artificial intelligent systems can learn causal representations and how the absence of the same in machine learning algorithms and models is giving rise to challenges in front of us.Read More
Model interpretability is crucial if we want to use AI models to make high-stake decisions (e.g., making medical diagnoses, preventing suicides, etc.). In NLP, one common way to get interpretability is to extract information from the trained models. For example, some use gradient-based input attribution techniques…Read More
This guide to programming languages and their uses will help enhance students’ knowledge of data science-related languages and encourage them to make the right choice.Read More
A set of 50 interview questions for Python. You never know what you might be asked.Read More
THiNC is a deep learning framework that makes composing, configuring and deploying models easy. It provides a flexible yet simple approach to modelling by providing low-level abstractions of the training loop, evaluation loop etc. Moreover, it plays well with major deep learning frameworks like TensorFlow and PyTorch. The functional programming API of THiNC is fairly simple and elegant. It’s light weighted API makes THiNC a good option for quick prototyping and deployment of machine learning models.Read More
The Best Machine Learning Frameworks & Extensions for Scikit-learnRead More
We combine the powers of NVIDIA RAPIDS and Amazon SageMaker to accelerate hyperparameter optimization (HPO). HPO runs many training jobs on your dataset using different settings to find the best-performing model configuration.Read More
In this blog, you will some of the most asked machine learning questions that every machine learning enthusiast has to answer one day. Let’s startRead More