'dataset' Forum Topics

2,634 Topics

	Topic Title
	Evaluating OpenAI GPT 4.1 for Text Summarization and Classification Tasks 2 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On April 14, 2025, OpenAI released [GPT-4.1](https://openai.com/index/gpt-4-1/) — a model touted as the new state-of-the-art, outperforming GPT-4o on all major benchmarks. As always, I like to evaluate new LLMs on simple tasks like text classification and summarization to see how they compare with current leading models. In this article, I … Computer Science api artificial-intelligence-llm daniweb-api dataset finance github google-api mathematics python 2 0 91
	Question/Answering over SQL Data Using LangGraph Framework 7 Months Ago 2 Months Ago Share on Facebook Share on Twitter Share on LinkedIn This tutorial demonstrates how to build an AI agent that queries SQLite databases using natural language. You will see how to leverage the [LangGraph framework](https://www.langchain.com/langgraph) and the [OpenAI GPT-4o](https://openai.com/index/gpt-4/) model to retrieve natural language answers from an SQLite database, given a natural language query. So, let's begin without ado. ## … Computer Science artificial-intelligence-llm data-structure dataset file-stream python sql sqlite 2 1 709
	DeepSeek R1 vs Llama 3.1-405b for Text Classification and Summarization 3 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In a [previous article](https://www.daniweb.com/programming/computer-science/tutorials/543028/text-classification-and-summarization-with-deepseek-r1-distill-llama-70b), I presented a comparison of [DeepSeek-R1-Distill-Llama-70b](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) with the [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) for text classification and summarization. Both these models are distilled versions of the original DeepSeek R1 model. Recently, I wanted to try the original version of the DeepSeek R1 model using the DeepSeek API. However, I was … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset github google-api python 1 0 165
	Text Classification and Summarization with DeepSeek R1 Distill Llama 70B 4 Months Ago 3 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In the [last article](https://www.daniweb.com/programming/computer-science/tutorials/542973/benchmarking-deepseek-r1-for-text-classification-and-summarization#post2300447), I explained how you can use the [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) model for text classification and summarization problems. In this article, we will use the [DeepSeek-R1-Distill-Llama-70b](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) for the same tasks. Following results from the [DeepSeek-AI's official paper](https://arxiv.org/pdf/2501.12948) show that `DeepSeek-R1-Distill-Llama-70b` outperform the other distilled models on 4 out of … Computer Science api artificial-intelligence-llm daniweb-api dataset github google-api pdf python 0 3 881
	Fine-tuning OpenAI Vision Models for Visual Question-Answering 8 Months Ago 5 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous article, I explained how to fine-tune [OpenAI GPT-4o model for natural language processing tasks](https://www.daniweb.com/programming/computer-science/tutorials/542333/how-to-fine-tune-the-openai-gpt-4o-model-the-wait-is-finally-over). In OpenAI DevDay, held on October 1, 2024, OpenAI announced that users can now fine-tune OpenAI vision and multimodal models such as GPT-4o and GPT-4o mini. The best part is that fine-tuning vision … Computer Science api artificial-intelligence-llm computer-vision daniweb-api data-science data-structure dataset github json os-x printer python 2 1 314
	Benchmarking DeepSeek R1 for Text Classification and Summarization 5 Months Ago Share on Facebook Share on Twitter Share on LinkedIn DeepSeek-R1 is a groundbreaking family of reinforcement learning (RL)-driven AI models developed by the Chinese AI firm [DeepSeek](https://www.deepseek.com/). It is designed to rival industry leaders like OpenAI and Google in complex decision-making and optimization problems. In this article, we will benchmark the DeepSeek R1 model for text classification and summarization … Computer Science api artificial-intelligence-llm daniweb-api dataset github google-api python 1 0 1K
	Qwen 2.5-72b Vs. Llama 3.3-70b for Text Classification and Summarization 6 Months Ago Share on Facebook Share on Twitter Share on LinkedIn Open-source LLMs are gaining significant traction due to their ability to match the performance of advanced proprietary LLMs. These models are free to use and allow users to modify their source code or fine-tune them on their own systems, making them highly versatile for various applications. Alibaba's [Qwen](https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1) and Meta's … Computer Science api artificial-intelligence-llm daniweb-api dataset github open-source python 4 0 839
	Evaluating GPT-4o November Model for Text Classification and Summarization 7 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On November 20, 2024, OpenAI updated its GPT-4o model, claiming it is more creative and accurate on several benchmarks. In this article, I compare the GPT-4o November update with the previous version (August update) for text summarization and classification tasks. By the end of this article, you will see whether … Computer Science api artificial-intelligence-llm daniweb-api dataset finance github google-api mathematics python 2 0 212
	Fine-tuning OpenAI GPT-4o for Multi-label Text Classification 7 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous article, I presented a [comparison of GPT-4o and Claude 3.5 Sonnet for multi-label text classification](https://www.daniweb.com/programming/computer-science/tutorials/542629/openai-gpt-4o-vs-claude-3-5-sonnet-for-multi-label-text-classification). The accuracies achieved by both models were relatively low. Fine-tuning is one solution to overcome the low performance of large-language models. With fine-tuning, you can incorporate custom domain knowledge into an LLM's … Computer Science api artificial-intelligence-llm daniweb-api data-science data-structure dataset finance json mathematics python 2 0 250
	OpenAI GPT-4o vs Claude 3.5 Sonnet for Multi-label Text Classification 7 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In one of my previous articles, you saw a [comparison of GPT-4o vs. Claude 3.5 sonnet for zero-shot text classification](https://www.daniweb.com/programming/computer-science/tutorials/542132/comparing-gpt-4o-vs-claude-3-5-sonnet-for-zero-shot-text-classification). In that article; we performed multi-class text classification where input tweets belonged to one of the three categories. In this article, we will go a step further and perform zero-shot … Computer Science api artificial-intelligence-llm daniweb-api dataset finance google-api mathematics python 2 0 190
	Qwen vs Llama - Who is winning the Open Source LLM Race 8 Months Ago 8 Months Ago Share on Facebook Share on Twitter Share on LinkedIn Open-source LLMS, owing to their comparable performance with advanced proprietary LLMs, have been gaining immense popularity lately. Open-source LLMs are free to use, and you can easily modify their source code or fine-tune them on your systems. [Alibaba's Qwen](https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1) and [Meta's Llama](https://ai.meta.com/blog/meta-llama-3-1/) series of models are two major players in … Computer Science api artificial-intelligence-llm daniweb-api dataset github open-source python 2 1 3K
	Text Classification and Summarization with Qwen 2.5 Model From Hugging Face 9 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On September 19, 2024, [Alibaba released the Qwen 2.5 series of models](https://qwenlm.github.io/blog/qwen2.5/). The Qwen 2.5-72B base and instruct models outperformed larger state-of-the-art models like Llama 3.1-405B on multiple benchmarks. It is safe to assume that Qwen 2.5-72B is a state-of-the-art open-source large language model. This article will show you how … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback dataset github google-api open-source python 3 0 2K
	Fine-Tuning OpenAI Whisper Model for Audio Classification in PyTorch 1 Year Ago 9 Months Ago Share on Facebook Share on Twitter Share on LinkedIn ## Introduction ## In a previous article, I explained [how to fine-tune the vision transformer model for image classification in PyTorch](https://www.daniweb.com/programming/computer-science/tutorials/540749/fine-tuning-vision-transformer-for-image-classification-in-pytorch). In this article, I will explain how to fine-tune the pre-trained OpenAI Whisper model for audio classification in PyTorch. Audio classification is an important task that can be applied … Computer Science audio computer-vision daniweb-feedback data-science dataset python 3 3 2K
	Extracting Structured Outputs from LLMs in LangChain 9 Months Ago Share on Facebook Share on Twitter Share on LinkedIn Large language models (LLMS) are trained to predict the next token (set of characters) following an input sequence of tokens. This makes LLMs suitable for unstructured textual responses. However, we often need to extract structured information from unstructured text. With the Python [LangChain](https://www.langchain.com/) module, you can extract structured information in … Computer Science algorithm artificial-intelligence-llm daniweb-feedback data-structure dataset engineering github python 2 0 218
	How to Fine-tune the OpenAI GPT-4o Model - The Wait is Finally Over 10 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On August 20, 2024, [OpenAI enabled GPT-4o fine-tuning](https://openai.com/index/gpt-4o-fine-tuning/) in the OpenAI playground and the OpenAI API. The much-awaited feature is free for fine-tuning 1 million daily tokens until September 23, 2024. In this article, I will show you how to fine-tune the OpenAI GPT-4o model for text classification and summarization … Computer Science api artificial-intelligence-llm daniweb-api data-science data-structure dataset github json python 2 0 1K
	GPT-4o Snapshot vs Meta Llama 3.1 70b for Zero-Shot Text Summarization 10 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In a previous article, I compared [GPT-4o mini vs. GPT-4o and GPT-3.5 Turbo for zero-shot text summarization](https://www.daniweb.com/programming/computer-science/tutorials/542208/gpt-4o-mini-vs-gpt-4o-vs-gpt-3-5-turbo-for-text-summarization). The results showed that the GPT-4o mini achieves almost similar performance for zero-shot text classification at a much-reduced price compared to the other models. I will compare Meta Llama 3.1 70b with OpenAI … Computer Science api artificial-intelligence-llm daniweb-api dataset github open-source python 2 0 1K
	Comparison of Fine-tuning GPT-4o mini vs GPT-3.5 for Text Classification 10 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous articles, I presented a [comparison of OpenAI GPT-4o mini model with GPT-4o and GPT-3.5 turbo models for zero-shot text classification](https://www.daniweb.com/programming/computer-science/tutorials/542182/gpt-4o-mini-a-cheaper-and-faster-alternative-to-gpt-4o). The results showed that GPT-4o mini, while significantly cheaper than its counterparts, achieves comparable performance. On 8 August 2024, OpenAI enabled GPT-4o mini fine-tuning for developers across … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback data-science data-structure dataset json python 1 0 314
	GPT-4o mini vs. GPT-4o vs GPT-3.5 Turbo for Text Summarization 11 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous [article on GPT-4o mini](https://www.daniweb.com/programming/computer-science/tutorials/542182/gpt-4o-mini-a-cheaper-and-faster-alternative-to-gpt-4o), I compared the performance of GPT-4o mini against GPT-3.5 Turbo and GPT-4o for zero-shot text classification. We saw that GPT-4o mini, being 36% times cheaper, achieves only 2% less accuracy than GPT-4o. Furthermore, while being 1/3 of the price, the GPT-4o mini significantly … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset github python 1 0 299
	GPT-4o mini - A Cheaper and Faster Alternative to GPT-4o 11 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On July 18th, 2024, [OpenAI released GPT-4o mini](https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/), their most cost-efficient small model. GPT-4o mini is around 60% cheaper than GPT-3.5 Turbo and around 97% cheaper than GPT-4o. As per OpenAI, GPT-4o mini outperforms GPT-3.5 Turbo on almost all benchmarks while being cheaper. In this article, we will compare the … Computer Science api artificial-intelligence-llm daniweb-api dataset python 3 0 217
	Extracting YouTube Channel Statistics in Python Using YouTube Data API 11 Months Ago Share on Facebook Share on Twitter Share on LinkedIn Are you interested in finding out what a YouTube channel mostly discusses? Do you want to analyze YouTube videos of a specific channel? If yes, we are in the same boat. YouTube video titles are a great way to determine the channel's primary focus. Plotting a word cloud or a … Computer Science api daniweb-api data-science dataset google-api python 4 0 106
	Comparing GPT-4o vs Claude 3.5 Sonnet for Zero Shot Text Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn On June 20, 2024, Anthropic released the [Claude 3.5 sonnet](https://www.anthropic.com/news/claude-3-5-sonnet) large language model. Claude claims it to be the state-of-the-art model for many natural language processing tasks, surpassing the [OpenAI GPT-4o model](https://openai.com/index/hello-gpt-4o/). My first test for comparing two large language models is their zero-shot text classification ability. In this article, … Computer Science api artificial-intelligence-llm daniweb-api dataset python 3 0 244
	Tabular Data Classification with Hugging Face Meta Tree Transformer 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn As a data scientist, I have extensively used the Hugging Face library for processing unstructured data such as images, text, and audio. My previous blogs have covered various transformer models for these types of data. Lately, however, I discovered that Hugging Face also provides transformer models for tabular data. One … Computer Science audio daniweb-feedback dataset machine-learning python 2 0 123
	Comparing Fine-tuned and Default GPT-3.5 Turbo for Text Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn # Comparison Between Fine-tuned and Default GPT-3 Turbo for Text Classification In one of my previous articles, I showed you how to perform [zero-shot text classification using OpenAI GPT-4o and Meta Llama 3 models](https://www.daniweb.com/programming/computer-science/tutorials/542001/openai-gpt-4o-vs-meta-llama-3-for-zero-shot-text-classifiation). I used the default models for predicting sentiments of airline tweets. The default models perform substantially … Computer Science api artificial-intelligence-llm daniweb-api data-science data-structure dataset json python 2 0 662
	OpenAI GPT-4o vs Meta Llama 3 for Zero Shot Text Classifiation 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn On April 18, 2024, Meta AI released [Llama 3](https://ai.meta.com/blog/meta-llama-3/), which they claimed to be the most capable openly available LLM to date. Concurrently, OpenAI announced [GPT-4o (omni)](https://community.openai.com/t/announcing-gpt-4o-in-the-api/744700) on May 13, 2024, which is touted as the state-of-the-art proprietary model for various NLP benchmarks. As a guy who loves to compare … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset google-api open-source python 2 0 264
	Claude 3 Opus Vs. Google Gemini Vs. GPT-4 for Zero-Shot Text Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn On March 4, 2024, [Anthropic](https://www.anthropic.com/) launched the [Claude 3 family of large language models](https://www.anthropic.com/news/claude-3-family). Anthropic claimed that its Claude 3 Opus model outperforms GPT-4 on various benchmarks. Intrigued by Anthropic's claim, I performed a simple test to compare the performances of Claude 3 Opus, [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction), and [OpenAI's GPT-4](https://openai.com/research/gpt-4) … Computer Science api artificial-intelligence-llm daniweb-api dataset file-stream google google-api json python 2 0 164
	Retrieval Augmented Generation (RAG) with Google Gemma From HuggingFace 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In a previous article, I explained [how to fine-tune Google's Gemma model for text classification](https://www.daniweb.com/programming/computer-science/tutorials/541544/fine-tuning-google-gemma-model-for-text-classification-in-python). In this article, I will explain how you can improve performance of a pretrained large language model (LLM) using retrieval augmented generation (RAG) technique. So, let's begin without ado. ## What is Retrieval Augmented Generation … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset github google google-api json open-source python 2 0 1K
	The Rise of AI Scams: Deciphering Reality in a World of Deepfakes 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Discover the world of AI scams and find out how you can shield yourself against the cunning deceptions of deepfakes. ![deepfakes-deep-implications.jpg](https://static.daniweb.com/attachments/4/782a49e1fa4e86bd0bedf3957bec4df9.jpg) In an incident that underscores the alarming capabilities of artificial intelligence in the realm of fraud, a company in Hong Kong was [defrauded of $25 million](https://www.businessinsider.com/deepfake-coworkers-video-call-company-loses-millions-employee-ai-2024-2) earlier this year. … Community Center abuse artificial-intelligence-llm dataset 2 0 916
	Fine Tuning Google Gemma Model for Text Classification in Python 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn On February 21, 2024, Google released [Gemma](https://ai.google.dev/gemma), a family of state-of-the-art open-source large language models (LLMs). As per initial results, its 7b (seven billion parameter) version is known to perform better than Meta's [Llama 2](https://llama.meta.com/), the previous state-of-the-art open-source LLM. As always, my first test with any new open-source LLM … Computer Science artificial-intelligence-llm dataset open-source python 2 0 1K
	Using ChatGPT to Interact with Third-Party Applications in Python 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Integrating language models like ChatGPT into third-party applications has become increasingly popular due to their ability to comprehend and generate human-like text. However, it's crucial to acknowledge the limitations of ChatGPT, such as its knowledge cut-off date in September 2021 and its inability to access external sources like Wikipedia or … Computer Science api artificial-intelligence-llm daniweb-api dataset python 3 2 1K
	Use of the Word ‘Tapestry’ in Web News More Than Doubled Last Year 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Tracing AI-generated content in online news articles with corpus linguistics ![tapestry-header.JPG](https://static.daniweb.com/attachments/4/c8a5b32abaf78b39bdcb75f328580e4a.JPG) A query in the 'News on the Web' Corpus reveals that the use of the word 'tapestry' in online articles has more than doubled last year – from 3,085 instances in 2022 to 7,891 instances in 2023 “Today, we … Community Center artificial-intelligence-llm dataset social-media 0 0 362
	Comparing Google Gemini Pro with OpenAI GPT-4 for Zero-Shot Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In this article, we will compare two state-of-the-art large language models for zero-shot text classification: [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction) and [OpenAI GPT-4](https://openai.com/research/gpt-4). Zero-shot text classification is a task where a model is trained on a set of labeled examples but can then classify new examples from previously unseen classes. This is … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback dataset engineering file-stream google google-api json python 1 0 181
	Multilabel Text Classification using Hugging Face Models for TensorFlow 2 Years Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn ## Introduction ## This tutorial explains how to perform multiple-label text classification using the [Hugging Face](https://huggingface.co/) transformers library. Hugging Face library implements advanced transformer architectures, proven to be state-of-the-art for various natural language processing tasks, including text classification. Hugging Face library provides trainable transformer models in three flavors: 1. Via … Computer Science api artificial-intelligence-llm daniweb-api dataset machine-learning python tensorflow 1 2 1K
	TensorFlow Keras Sequence Data Generator for Multimodal Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn I recently tackled a challenging research task involving multimodal data for a classification problem using [TensorFlow Keras](https://www.tensorflow.org/guide/keras). One of the trickiest aspects was figuring out how to load multimodal data in batches from storage efficiently. While TensorFlow Keras offers helpful functions for batch-loading images from various sources, the documentation and … Computer Science daniweb-feedback data-structure dataset os-x python storage tensorflow 2 0 115
	Sentiment Analysis with Data Augmentation Using ChatGPT 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Sentiment analysis, a subfield of Natural Language Processing (NLP), aims to discern and classify the underlying sentiment or emotion expressed in textual data. Whether it is understanding customers' opinions about a product, analyzing social media posts, or gauging public sentiment towards a political event, sentiment analysis plays a vital role … Computer Science api artificial-intelligence-llm dataset machine-learning python social-media 6 6 2K
	Multivariate Stock Price Prediction with Transformer Encoder in TensorFlow 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In a [previous tutorial](https://www.daniweb.com/programming/computer-science/tutorials/541123/stock-price-prediction-using-1d-cnn-in-tensorflow-keras), I covered how to predict future stock prices using a deep learning model with 1D CNN layers. This method is effective for basic time series forecasting. Recently, I've enhanced this model by not just considering past closing prices but also factors like Open, High, Low, Volume, … Computer Science api apple daniweb-api daniweb-feedback data-science dataset google-api python tensorflow 0 0 173
	Custom Loss Functions in PyTorch: A Comprehensive Guide 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn ## Introduction ## Loss functions are the driving force behind all machine learning algorithms. They quantify how well our models are performing by calculating the difference between the predicted and actual outcomes. The goal of every machine learning algorithm is to minimize this loss function, thereby improving the model’s accuracy. … Computer Science algorithm dataset machine-learning python tensorflow 3 1 460
	Facial Emotion Detection with Vision Transformers and DeepFace Library 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Facial emotion detection, as the name suggests, involves detecting emotions from faces in images or videos. Recently, I was working on a facial emotion detection task and came across the DeepFace library that implements various state-of-the-art facial emotion detection models. However, in my experience, the performance of the DeepFace library … Computer Science artificial-intelligence-llm computer-vision daniweb-feedback data-science data-structure dataset os-x programming-construct python 3 1 509
	Stock Price Prediction Using 1D CNN in TensorFlow Keras 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Stock price prediction is a challenging task that requires analyzing historical trends, market sentiments, economic indicators, and company performance. One of the popular methods for stock price prediction is using deep learning models, such as convolutional neural networks (CNNs). CNNs are a type of neural network that can extract features … Computer Science apple audio daniweb-feedback dataset finance python tensorflow 1 0 581
	Video Classification using Hugging Face Transformers in PyTorch 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In this tutorial, you will learn to fine-tune a [Hugging Face Transformers model](https://huggingface.co/docs/transformers/index) for video classification in PyTorch. The Hugging Face documentation provides an example of performing video classification using the Hugging Face Trainer with one of Hugging Face's built-in datasets. However, the process of fine-tuning a video transformer on … Computer Science artificial-intelligence-llm audio computer-vision daniweb-feedback data-science dataset os-x python video 2 0 438
	Fine Tuning Text Classification Models with Chat-GPT 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In a previous article, I showed you [how to analyze sentiments using Chat-GPT and data augmentation techniques](https://www.daniweb.com/programming/computer-science/tutorials/540502/sentiment-analysis-with-data-augmentation-using-chatgpt#post2293643). Following that, some readers reached out, asking for a breakdown of fine-tuning a Chat-GPT model. In this article, I will guide you through fine-tuning your Chat-GPT model using your own data. First, I'll … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback data-science dataset json programming-construct python 2 0 510
	Enhancing Language Models: Choosing Between RAG and Fine-Tuning 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In my recent journey of developing various AI solutions powered by Language Models (LLMs), a significant question has emerged: Should we harness the capabilities of Retrieval Augmented Generation (RAG), or should we opt for the path of custom fine-tuning? This decision can profoundly impact the performance and adaptability of our … Computer Science artificial-intelligence-llm dataset engineering operating-system 4 1 851
	SQL Query Optimization: Combining Multiple Joins for Improved Performance 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn I'm working on an SQL query for a complex reporting system that involves multiple tables and joins. However, the query's performance is not meeting my expectations, and I suspect that the way I've structured my joins might be inefficient. Here's a simplified version of my query: SELECT orders.order_id, customers.customer_name, products.product_name, … Databases database-design dataset mysql sql 2 4 167
	Text Classification Using Data Annotation with ChatGPT 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Data annotation for text classification is time-consuming and expensive. In the case of smaller training datasets, pre-trained ChatGPT models might achieve higher classification accuracy on test sets than training classifiers from scratch or fine-tuning existing models. Additionally, ChatGPT can aid in annotating data for fine-tuning text classification models. In this … Computer Science api artificial-intelligence-llm dataset machine-learning python 3 1 728
	Beginner - Code working for sample data only 2 Years Ago 2 Years Ago Share on Facebook Share on Twitter Share on LinkedIn Hi everyone, I'm new to Python. My manager wants me to run a Python code and generate output for 40 set of values. The code works fine for sample data. But when I replace it with actual data, it doesn't give me any output. Below is the code. Sample data … Software Development dataset python 1 1 158
	Translating CSV Files using DeepL and Pandas Dataframes in Python 2 Years Ago Share on Facebook Share on Twitter Share on LinkedIn ## Introduction ## In this tutorial, you will see how to convert the text in CSV file columns to other languages using the [DeepL API](https://www.deepl.com/translator) in the Python programing language. DeepL is one of the most popular and accurate text translation platforms. DeepL, as the name suggests, incorporates advanced deep … Computer Science api daniweb-api dataset github pdf python 1 0 985
	create coordinates on places between different points 2 Years Ago 2 Years Ago Share on Facebook Share on Twitter Share on LinkedIn I have a dataset with coordinates from all letterboxes in a certain area. I want to have all the letterboxes in a maximum of 500m away for everyone. So if there is a place without a letterbox nearby, I want to place there a letterbox (or more than one if … Programming dataset python 0 1 33
	Postprocessing Multilabel Ranked Annotations in Python 2 Years Ago Share on Facebook Share on Twitter Share on LinkedIn In my [previous articles](https://www.daniweb.com/programming/computer-science/tutorials/538512/finding-inter-annotator-agreement-between-three-annotators-in-python#post2287428), I explained how you could apply heuristic and statistical approaches for finding inter-annotator agreement between multiple annotators. However, while applying those approaches, I found that finding inter-annotator agreement in the case of multi-label ranked data is a difficult task, and traditional inter-annotator agreement techniques will almost … Computer Science data-science dataset python 0 0 121
	Statistical Approaches for Inter-Annotator Agreement with Pandas Dataframes 2 Years Ago Share on Facebook Share on Twitter Share on LinkedIn In my [previous tutorial](https://www.daniweb.com/programming/computer-science/tutorials/538512/finding-inter-annotator-agreement-between-three-annotators-in-python), I explained how I implemented heuristic approaches for finding inter-annotator agreement between three annotators. Heuristic approaches are excellent for understanding the degree of agreement between multiple annotators. However, you should back your analysis with statistical evidence. This is where statistical techniques for inter-annotator agreement come into … Computer Science data-science dataset python 2 0 567
	Finding Inter Annotator Agreement between three Annotators in Python 2 Years Ago Share on Facebook Share on Twitter Share on LinkedIn I recently worked on a research project where I had to find the inter-annotator agreement for tweets annotated by three annotators. Inter annotator agreement refers to the degree of agreement between multiple annotators. The quality of annotated (also called labeled) data is crucial to developing a robust statistical model. Therefore, … Computer Science dataset python 3 0 232
	Need Vehicle Dataset with Images in PHP Format 4 Years Ago 3 Years Ago Share on Facebook Share on Twitter Share on LinkedIn Hi Everyone! I am new here, and the main purpose of joining this community is getting some help. Actually, I am designing a PHP website where I need to fetch US vehicles data into PHP. I have already used this source to get the vehicle's dataset from here https://www.back4app.com/database/back4app/car-make-model-dataset. Still, … Databases dataset microsoft-office php 1 12 358

The End.

[dataset] Forum Topics

2,634 Topics