python+numpy利用梯度下降算法实现逻辑回归（对比sklearn官方demo）-编程知识网

本站消息

出租广告位,需要合作请联系站长

今日名言-想象你自己对困难作出的反应，不是逃避或绕开它们，而是面对它们，同它们打交道，以一种进取的和明智的方式同它们奋斗。——马克斯威尔·马尔兹

今日名言-用谅解、宽恕的目光和心理看人、待人。人就会觉得葱笼的世界里，春意盎然，到处充满温暖。——蔡文甫

天使是怎样炼成的

文章

36235

访问

+关注

分类

暂无分类

日期归档

暂无数据

python+numpy利用梯度下降算法实现逻辑回归（对比sklearn官方demo）

发布于2021-07-25 07:42 阅读(1059) 评论(0) 点赞(18) 收藏(4)

文章目录

概要描述

在逻辑回归中，可以使用梯度下降算法来求解模型（即确定sigmoid函数中的各系数和截距）。
本文用python实现梯度下降算法，并使用numpy做矩阵计算求解逻辑回归模型，最后与sklearn官方代码结果验证结果正确性。

详细说明

python实现梯度下降算法

# !/usr/bin/python
# -*- coding: utf-8 -*-
"""
PROJECT_NAME = Datawhale
Author : sciengineer
Email : 821072960@qq.com
Time = 2021/7/22 18:09
"""


import numpy as np

# General a toy dataset:s it's just a straight line with some Gaussian noise:
xmin, xmax = -5, 5
n_samples = 100
np.random.seed(0)
X = np.random.normal(size=n_samples)
y = (X > 0).astype(np.int64)
X[X > 0] *= 4
X += .3 * np.random.normal(size=n_samples)
X = X[:, np.newaxis]


# learning rate: lr
lr = 1


# gradient descent algorithm
def grad_desc(x_ndarr, y_ndarr):


    """
    :param x_ndarr: ndarray of features (for example, 422 samples with 3 fetures, then the shape of x_ndarr
    is (422,3)
    :param y_ndarr: ndarray of label (for example, 422 samples with 1 label, then the shape of y_ndarr
    is (422,)
    :return: a list contains interception and coeffients of the logistic regression model
    """
    x0 = np.ones(x_ndarr.shape[0])
    # in order to use matrix multiplication, add this column x0
    x_ndarr = np.insert(x_ndarr, 0, x0, axis=1)
    theta_arr = np.zeros(x_ndarr.shape[1])

    # to find a reasonable number to initialize the j_loss_last, I
    # calculate the j_loss when i == 0 in the for loop (69), and
    # set j_loss_last the  same order of magnitude with j_loss.
    j_loss_last = 1e2
    delta = 1e-10

    for i in range(10 ** 10):

        # z = theta0 * x0 + theta1 * x1 +theta2 * x2+ ... +thetan * xn
        # z = np.dot(x_ndarr, theta_arr)
        z = np.dot(x_ndarr, theta_arr)
        # z = np.dot( theta_arr, x_ndarr.T)
        #
        y_hat = 1/(1+np.exp(-z))
        # use the clip to bound the y_hat in (0,1) , and to avoid warning "divide by zero encountered in log" or
        # "invalid value encountered in log". Otherwise this will ruin the optimization.
        y_hat = np.clip(y_hat,delta,1-delta)
        # j_loss = np.dot((y_hat - y_ndarr).T, (y_hat - y_ndarr))
        j_loss = -np.dot(y_ndarr , np.log(y_hat))  - np.dot((1-y_ndarr),np.log(1-y_hat))
        delta_j_loss = j_loss_last - j_loss
        rate = abs(delta_j_loss / j_loss)
        # partial derivative of function j_loss with respect to variable theta_arr
        pd_j2theta_arr = np.dot(y_hat - y_ndarr, x_ndarr)

        # theta_arr updates each interation
        theta_arr = theta_arr - lr * 0.01*pd_j2theta_arr
        j_loss_last = j_loss

        # I choose the rate as the condition of convergence

        if rate < 5 * 1e-10:
            break
    return theta_arr



theta_arr = grad_desc(X, y)
# The coeffients: theta1, theta2,..., thetan
print('Coeffients: \n', theta_arr[1:])
# The interception: theta0
print('Interception: \n', theta_arr[0])

运行结果：

Coeffients: 
 [6.90478134]
Interception: 
 -1.6481480918181262

sklearn官方demo

只选取这个官方demo中的前面部分，主要是为了验证手写的逻辑回归算法实现的正确性。

# !/usr/bin/python
# -*- coding: utf-8 -*-
# Code source: Gael Varoquaux
# License: BSD 3 clause

import numpy as np
import matplotlib.pyplot as plt

from sklearn import linear_model
from scipy.special import expit

# General a toy dataset:s it's just a straight line with some Gaussian noise:
xmin, xmax = -5, 5
n_samples = 100
np.random.seed(0)
X = np.random.normal(size=n_samples)
y = (X > 0).astype(float)
X[X > 0] *= 4
X += .3 * np.random.normal(size=n_samples)

X = X[:, np.newaxis]

# Fit the classifier
clf = linear_model.LogisticRegression(C=1e10)
clf.fit(X, y)

print('Coeffients: \n', clf.coef_)
print('Interception: \n',clf.intercept_)

运行结果：

Coeffients: 
 [[6.90879439]]
Interception: 
 [-1.64913083]

可以看到，使用python实现的逻辑回归与sklearn库的运行结果一致。当然，这个实现肯定有许多需要优化的地方，欢迎讨论。

所属网站分类: 技术文章 > 博客

作者：天使是怎样炼成的

链接：http://www.pythonpdf.com/blog/article/490/af8b995f391516549b1e/

来源：编程知识网

任何形式的转载都请注明出处,如有侵权一经发现必将追究其法律责任

18 0

收藏该文

昵称:

评论内容：(最多支持255个字符)

程序员的那些事(new)

数据仓库hive概念与数据仓库分层、概念模型、逻辑模型、物理模型

Qt TCP/UDP通讯封装

2021-07-01常见的Dos命令

干了八年java开发，被比自己小7岁的主管羞辱，这一刻好像真的意识到了什么是中年危机

[建议收藏] 妹子惊叹: 原来测试执行的流程竟是这样的？！

C语言实现双人猜数游戏

程序人生3

程序员（媛）不懂汉服？岂能让别人小看，咱先靠肉眼大数据识别万张穿搭照

筛选系统架构设计师考试上午综合知识易错常考真题，及详细解答

【SaaS云会议系统开发】项目实训——2021.07.06

电子书(new)

Python3.2.3官方文档（中文版）pdf下载

Head First Python（中文版）pdf下载

FlaskWeb开发：基于Python的Web应用开发实战 pdf下载

Beginning Python Games Development(2nd) pdf下载

Python Cookbook第三版中文PDF下载高清完整扫描原版

django book 2.0中文 PDF下载高清完整扫描原版

python编码规范PDF下载高清完整

笨办法学python pdf下载

Java与模式 pdf下载

深入剖析Tomcat pdf下载

脚本(new)

用python画国旗

python文件解压脚本

python分类文件脚本下载

实时跟踪人物运动轨迹

PYTHON画樱花树

100行的加强型字符串 python实用脚本下载

一些有趣的java小程序

菱形的图形生成器，2到100以内的质数-java实用小程序

java代码游戏编写

9个有趣的java代码

博客(new)

linux screen

计算日期到天数的转化（华为机试python）

Windows平台搭建Swig环境示例，并且解决python无法返回double类型的问题

Python调用C语言实现数独计算逻辑提速100倍以上

python 类中列表使用append出现实例重复的问题

python 对象引用、可变性和垃圾回收

关于不同版本torch保存训练参数的问题及其他（SPIN）

Python&Qt——yolov5手势识别隔空操纵车载音乐播放器

Hanlp工具安装问题解决(windows)

python +opencv 进行移动端UI自动化

视频教程(new)

Python网络爬虫实战爬虫视频教程下载

2020年抗疫之作java基础进阶13天

系统学习让你轻松定义java类加载器-java视频教程下载

真小白福音，完全从0带你掌握计算机与编程基础

撩课-Python大数据+人工智能1 python视频教程下载

撩课-Python大数据+人工智能2 python视频教程下载

java零基础入门到精通（2019版）

2020JAVA基础-深入系统的学习数据结构与算法

图灵学院 Python全系列教程全栈工程师 python视频教程下载

9天快速掌握java基础，更适合小白学习的Java基础,java视频教程下载

项目实战(new)

实战项目之拉勾网

前端实战项目之幸福西饼

实战项目之头条

实战项目之网易云

Vue实战项目之喵喵电影

小程序全栈开发之喵喵交友

2020前端React项目实战

2020版uni-app项目实战

打造仿小米电商网站

NodeJS+Express+MongoDB实战项目

问答(new)

在 Django forms.ChoiceField 上设置选定的值

如何读取Model实例的数据库表名？

在 Django 1.7 中运行单元测试时禁用迁移

使用 django-allauth 时如何自定义用户配置文件

Django admin：如何在模型中显示标记为 editable=False' 的字段？

Django：使用整数设置外键？

如何在 Django 中对文件上传进行单元测试

在 Django 中导航

如何为基本身份验证发送正确的授权标头

导入错误：无法从“django.utils”导入名称“6”

游戏(new)

java游戏-蛙人

经典骰子游戏,java游戏源码下载

一些java小游戏

python小游戏五子棋之人机对战源码下载

python小游戏贪吃蛇源码下载

java小游戏-抓住这只球

二十一点，java小游戏源码下载

java小游戏集合下载

java五子棋游戏源码下载

java贪吃蛇游戏

其他资源(new)

Maven+JSP+SSM+Mysql实现的音乐网站

HTML5中国象棋游戏

地图拼图游戏

网页版台球小游戏

HTML5街头霸王游戏源码

jQuery五子棋游戏

驴子跳跳网页游戏源码

小球撞击游戏源码

HTML5制作的2048游戏

jQuery益智拼图游戏